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Abstract 

A branching random tessellation (BRT) is a stochastie process that transforms a 
coarse initial tessellation of M.'^ into a finer tessellation by means of random cell divi- 

^T) • sions in continuous time. This concept generalises the so-called STIT tessellations, 

for which all cells split up independently of each other. Here we allow the cells to 

^^ ' interact, in that the division rule for each cell may depend on the structure of the 

surrounding tessellation. Moreover, we consider coloured tessellations, for which 
each cell is marked with an internal property, called its colour. Under a suitable 
condition, the cell interaction of a BRT can be specified by a measure kernel that 
determines the division rules of all cells and gives rise to a Gibbsian characterisation 

0^ , of BRTs. For translation invariant BRTs, we introduce an 'inner' entropy density 

relative to a STIT tessellation. Together with an inner energy density for a given 

^> , 'moderate' division kernel, this leads to a variational principle for BRTs with this 

n , ' prescribed kernel, and further to an existence result for such BRTs. 
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tessellation; Gibbs measure, relative entropy, free energy, variational principle. 
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This article is dedicated to the memory of the second author, Tomasz Schreiber. 
Shortly before his untimely death of cancer at the age of 35, he entrusted to the 
^ ' first author a draft manuscript that introduced the general setup of this paper and 

initiated a study of relative entropy densities for BRTs. This paper tries to realise 
his intentions with various modifications of concepts and results. 
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^ ■ 1 Introduction 

m . 

A central object of stochastic geometry and spatial stochastics are tessellations of M 
(with d > 1), i.e. locally finite families of d-dimensional convex polytopes that cover 
W^ and have pairwise disjoint interiors. They are used in many practical applications. 
For example, random tessellations serve as models for cellular or polycrystalline mater- 
^ _• ials, plant cells or influence zones, for instance in the modelling of telecommunication 

networks or animal territories; see [14^ [M] for an overview. 

The standard random tessellations usually considered in the literature are the Pois- 
son hyperplane tessellations, the Poisson-Voronoi and the Poisson-Delaunay tessella- 
tions; cf. [T7] for definitions. These have the the property of being facet-to- facet (or 
side-to-side in the planar case), which is to say that the intersection of any two of its 
cells is either empty or a common face of both cells. However, there are numerous ap- 
plications for which models of this kind are inappropriate, for example network models 
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for telecommunication systems or models for crack structures in geology. Hence, there 
is a growing demand for mathematically tractable models of non-facet-to-facet tessella- 
tions, which may serve as idealised reference models. Only some years ago, the class of 
iteration-stable random tessellations (called STIT tessellations for short) was introduced 
by Nagel and Weifi in |13j . These tessellations are constructed by means of a random 
process of cell division and have attracted considerable interest because of its analytical 
tractability; see for example [15], [l8]-[23] or [25] , 

Our objects of study here are generalisations of the STIT models. On the one hand, 
we consider coloured tessellations, for which each cell is equipped with an individual 
colour. For example, the colour of a cell could represent its nutrient content, its gen- 
otype, age, or whatever else might be relevant to describe the state of a cell. (In a 
different context, coloured tessellations have been studied by Arak and Surgailis [Il[2], 
for example.) On the other hand, and more importantly, we allow for an interaction 
of cells during their division process. Informally, the procedure is the following. At 
time zero, one starts with an initial random tessellation of M into coloured cells. Each 
cell lives for a random time, which is determined by an interactive competition of cells. 
Namely, the survival rate of a cell c at any time s > may not only depend on the cell's 
geometry and colour, but in fact on the whole tessellation including its past evolution. 
When the lifetime has run out, a hyperplane with coloured half-spaces is chosen ran- 
domly according to some rule that may again depend on the cell's geometry, colour and 
the past evolution of the surrounding tessellation, and is used to cut c into two poly- 
hedral sub-cells c"*" and c~ , which inherit their colours from the respective half-spaces 
of the cutting hyperplane. The daughter cells c"*" and c~ then replace c in the collective 
division game, which is continued until time 1, say. The resulting tessellation of R at a 
deterministic time s > is denoted by Tg, and the tessellation- valued stochastic process 
(Ts)sgro^i] is what we call a branching random tessellation or BRT for short. 

In the special case when (i) the distribution of lifetimes is exponential with para- 
meter proportional to the mean width of the cells, and (ii) the bicoloured hyperplanes 
are chosen at random according to the motion-invariant hyperplane measure and some 
reference measure on the colour space, (Ts)^g[o,i] is a coloured STIT tessellation of M*^ 
and its distribution is invariant under rigid motions whenever so is the initial random 
tessellation. The coloured STIT tessellations play an important role in the background 
of our theory, but our focus is on cell-splitting mechanisms with interaction as described 
above. Different tessellation models with cell interaction, namely Delaunay or Voronoi 
tessellations of Gibbsian type, are discussed in [3l[5l[6]. 

The main results of this paper are the following. 

- First, we investigate the behaviour of general BRTs, typically denoted by P. Under a 
condition of local absolute continuity (LAC), we establish the existence of a measure 
kernel ^ that captures the behaviour of P in two respects. On the one hand, it 
describes the evolution in infinitesimal time by fixing a cell splitting rule. On the 
other hand, it specifies the spatial interdependence of the cells during their overall 
evolution. The latter gives rise to a Gibbsian description of P, in that $ determines 
the conditional distribution under P of the evolution of all cells within any bounded 
window when the evolution of all other cells is given. 

- We then turn to a kind of thermodynamic formalism for BRTs P that are invariant 
under spatial translations. The basic quantity is an inner entropy density /i™(P), 
which is defined as the limit of a conditional entropy per unit volume of P relative 
to a reference STIT model. The adjective 'inner' refers to the fact that only the 
cells completely inside the respective window are taken into account, rather than all 
cells that hit the window. The functional /i™ will be shown to share some familiar 



properties of the entropy densities for the standard models of statistical mechanics, 
at least with some natural adaptations. 

- Finally, we consider an arbitrary division kernel ^ that satisfies some mild regularity 
conditions, which roughly require that ^ is not too far from a STIT kernel; such a ^ 
will be called moderate. We introduce an associated inner energy density u^^(P; ^) 
as well as some sort of pressure f™(P;^). The resulting inner excess free energy 
density /i™(P; ^) gives rise to a variational principle, which states that the minim- 
isers of /i™( • ; ^) are precisely the translation invariant BRTs that admit ^ as their 
division kernel. It is further shown that such minimisers do exist, for any prescribed 
distribution P of the time-zero tessellation. This proves the existence of a BRT P 
for any given initial distribution P and any moderate division kernel ^. For general 
^, such a P is not necessarily unique. 

The paper is organised as follows: Section [2] introduces the setup and recalls some 
necessary facts. Besides tessellations and BRTs, the main concepts are division kernels 
and local conditional BRTs of Gibbsian type. This section also includes some examples 
of division kernels to which our theory applies. The main results together with their 
framework are stated in Section [3l These are Theorem 13.11 on the existence and signi- 
ficance of global division kernels. Theorems 13.51 and 13.61 on the existence of the inner 
entropy density and its properties, and Theorems 13.91 and 13.101 on the variational char- 
acterisation and the existence of invariant BRTs with given moderate division kernels. 
All proofs are collected in the final Section [H 

2 Preliminaries 

2.1 Tessellations 

2.1.1 Polytopes and tessellations. Consider the Euclidean space M of arbitrary dimen- 
sion d > 1. We shall deal with certain random processes of coloured tessellations of M*^ 
into (coloured) convex polytopes. Let us specify these terms. First, a polytope p in M is 
the closed convex hull of a finite set of points and is always assumed to have non-empty 
interior; the set of all such polytopes is denoted by P. Each polytope p G P is equipped 
with a translation covariant selector m{p), called its ^ centre^ or ^ midpoint, for example 
its barycentre, its Steiner point or its circumcentre. We write r{p) = uiaxx^p \x — m(p)\ 
for its radius and dp and int(p) for its topological boundary resp. interior. 

More generally, we will assume that each polytope is marked with some internal 
property, called its colour. So, we fix an arbitrary Polish space S, which we call the 
colour space. A coloured polytope, called cell in the sequel, is a pair c = {p, a) with p € P 
and cr € S. Let us denote by sp(c) := p and col(c) := a, respectively, the spatial part and 
the colour of c. The space of cells is thus C := P x S. To simplify notation we adopt the 
general convention that spatial operations on cells (and also on coloured tessellations 
defined below), such as intersections with subsets of M°' and translations, solely refer 
to the spatial part and do not affect their colours. For example, m(c) := m(sp(c)), 
r{c) := r(sp(c)), int(c) := int(sp(c)), cCiW := (sp(c) fl W,col(c)) for W C M'^, and 
c — X := (sp(c) — a:;,col(c)) when x € M.'^. Finally, vol(c) := volrf(sp(c)) is the {d- 
dimensional) volume of the spatial part of c. Let us also define the space Co = {c € C : 
m{c) = 0} of cells having their midpoint at the origin. 

The cells are the constituents of the coloured tessellations which we introduce now; 
for brevity we will omit the adjective 'coloured' in the following. (Note that letting 
S be a singleton one recovers the uncoloured case usually considered in the literature; 
cf. [IZIIM].) 

Definition 2.1. A (coloured) tessellation T of R*^ is a countable subset of C such that 



• r is locally finite, in that any bounded subset of W^ only hits a finite number of 
cells from T, 

• two distinct cells of T have disjoint interiors, i.e. int(c) nint(c') = for all c,c' (z T 
with c ^ c', 

• the cells cover the whole space, which is to say that |J c = M . 

ceT 

The space of all tessellations of M*^ will henceforth be denoted by T. 

Besides tessellations of M , we will also consider tessellations in local windows W C 
W^, which will generally be chosen to be polytopes, or sometimes also a finite unions 
of polytopes. So, we write Py for the set of all finite, not necessarily connected unions 
of polytopes, and for W € F^ we let Ciy be the set of cells that are contained in W. 
We finally write T\y for the set of all tessellations of W, that is, of all finite collections 
{ci, . . . , Cn} of cells with pairwise disjoint interiors and such that ci U . . . U c„ = W. 

2.1.2 Measurability. We need measurable structures on all spaces introduced above. 
We start with the space P of polytopes. As the sets in P are compact and non-empty, the 
natural metric on P is the usual Hausdorff distance dn', cf. J17l Chapter 12.3]. Hence, 
the space P can be equipped with the Borel c-field B{¥) induced by dn- In fact, B{¥) 
is generated by the sets {p € P : p n S / 0} with B € S(M'^), the Borel a-field on R'^; 
see [T71 Chapters 12.2-12.3]. The coloured counterpart C is endowed with the product 
a-field ^(C) = B{¥) O B{T.), where B{T.) is the Borel a-field on S. The space Co of 
centred cells receives the trace c-field. 

We next need to introduce a suitable cr-field on T. As is usual in point process 
theory, we let B{T) be the cj-field generated by the counting variables 

(2.1) iVA :ir^Nu{+oo}, T^ jm^l, A £ B{C), 

where | • | stands for the cardinality of the argument set, i.e. N^ counts how many 
cells of T belong to A. In view of the structure of B{C), B{T) is also generated by the 
random variables 

Nb,s ■.T3T^\{ceT:cnB^$, col(c) e S}\, 

with B a bounded Borel set in W^ and S € B^E). Moreover, B{T) is the Borel u-field 
for the vague topology on T, which is generated by the functions 



egiT^iCoo), T^^5(c), 
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where 5 > is a continuous function on C with a bounded support in the spatial 
coordinate; see |101 Appendix 15.7] or |1H Theorem A2.3]. 

Definition 2.2. A probability measure P on {T,B{T)) satisfying the first-moment con- 
dition J P{dT) \Tw\ < 00 for all windows VF S P is called a random tessellation. The 
set of all such P is denoted by ^{T). 

To deal with local properties of tessellations we will often restrict a tessellation to a 
local window M^ € P. We thus define the projection to such a TV by 

(2.2) TTw-.T^Tw, T^Tw:={cnW -.ceT, int(c n P^) / 0} . 

In the same manner as above, we may introduce a cr-field B{Tw) on Tw- One can then 
easily check that the mapping ttw is measurable. 



2.2 Branching tessellations 

2.2.1 Cutting cells by hyperplanes. We now turn to the main objects of our investigation: 
tessellations which arise from a given initial tessellation by a successive splitting of cells 
into two pieces by means of suitable hyperplanes. Recall that a hyperplane rj with unit 
normal u € S^~ (upper unit half-sphere) and signed distance r € M to the origin can 
be written in the form r] = {x £ M. : {x,u) = r}, where (•, •) stands for the usual 
scalar product. So, the space of hyperplanes can be identified with S^~ x M. For r/ 
as above, we write rj^ = {x £ M. : {x, u) > r} and 77" = {x G M : (x, u) < r} for 
the associated half-spaces. More generally, we consider bicoloured hyperplanes H = 
{r],a~^ ,a~) £ M := §.^7 x M x S^, for which each of the half-spaces r] is equipped 
with a colour a^. We write sp{H) := rj and col {H) := a^, respectively, for the spatial 
part and the colours of H and again adopt the convention that spatial operations with 
bicoloured hyperplanes only refer to the spatial part, for example, cCiH := sp(c) nsp(if) 
or c n H^ := (sp(c) PI sp{H)^ , col (H)) for any c G C. Moreover, for such a cell c, we 
let 

(2.3) {c) = {H £M:Hn int(c) / 0} 

be the set of all bicoloured hyperplanes, the spatial part of which hits the interior of the 
spatial part of c. Each bicoloured hyperplane H defines a cell division operation on 
tessellations. Namely, let T G T, c G T and H £ (c). Then is defined by 

(2.4) 0e,H (T) := (T \ {c}) U {c n i/+, c n if ^} 

with cCiH^ as above. Branching tessellations are now defined as follows. For simplicity, 
the time interval will mostly be the unit interval [0, 1]. 

Definition 2.3. (a) Let VF G Py be a finite union of polytopes. A branching tessellation 
in the window W with bounded time interval / = [a, b) or [a, b] is a family T = (Ts)sei 
of tessellations in W such that 

• the function s ^ Tg from / to T^y is piecewise constant, right-continuous and has 
only a finite number of jumps, 

• at each point s of discontinuity (so that T^ ^ Tg^ := liuir-^sTr), there exists a 
unique cell c £ Ts- and a bicoloured hyperplane H £ (c) such that 

Ts = 0c,h{Ts-). 

Further, T^ is called the initial tessellation. We write IBT^y for the set of all such 
branching tessellations in W. 

(b) A family T = (Ts)o<s<i is called a branching tessellation in M^ if for each 
window W £ F the restricted process T^ = 7rvK(T) := (7rvi/(T's))Q<^<]^ is a branching 
tessellation in W. Again, Tq is then called the initial tessellation of T. The set of all 
branching tessellations in W^ is denoted by BT. 

The following remark provides a further way of describing the time evolution of a 
branching tessellation. 

Remark 2.4. (a) Let T be a branching tessellation in a window VF G Py with time 
interval I = [0,1]. (The case of other time intervals is similar.) Keeping record of all 
jump times of T together with the associated cells that are divided and the respective 
cutting hyperplanes, one arrives at the set 



^(T) = <^ (s, c,H) £ {0,1] xCxM: T,_ ^ T,, 
(2.5) ^ 

c G T,_, H £ (c), Ts = 0c,HiTs^)\ 
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Figure 1: Representation of a two-coloured branching tessellation in a finite window 
with initial tessellation Tq = {c, c'}. The cells living at a time s constitute a tessellation 
Ts- At each time Si, a cell that lives up to this moment is selected and cut in two by a 
bicoloured hyperplane Hi, which impresses its colours onto the cell's pieces. 



of all 'division events'. There is a one-to-one correspondence between T and the pair 
(Tq, &{T)), in that T can be recovered from the initial tessellation Tq and the set ^(T) 
of division events. Indeed, labelling the elements of ^(T) with the indices 1, . . . ,n := 
|^(T)| according to the order of their time coordinates so that =: sq < si < . . . < 
Sn < Sn+i '■= 1, one has the recursion Tg = Tq for s G [0, si) and 

Ts = 0ci,Hi(Ts,_j^) for s G [si,Si+i), i=l,...,n. 

Finally, Ti =Ts„. 

This description also gives rise to a convenient way of visualising T as a graph in 
[0, 1] X Cw'i see Figure [TJ The set of vertices is 

r(T) = {(0, c) : c G To} U {{s, cHH^): {s, c, H) G ^(T)}. 

Moreover, each (s,c) G 'f{T) is equipped with a 'lifeline' [s,s*) x {c}, where s* = s' if 
{s',c,H) G ^(T) for some s' > s and H G (c), and s* = 1 otherwise. If s* < 1, this 
lifeline is augmented by the lines from (s*,c) to the two children (s*,cn H^) of {s,c). 
If s* = 1, the half-open line [s, 1) x {c} is replaced by the closed line [s, 1] x {c}. In this 
way one obtains a finite forest of binary 'family' trees in Cw that evolve from the cells 
of Tq. So, these cells are the roots, or ancestors, and the |ro| + |^(T)| leaves form the 
tessellation Ti . This branching mechanism is strongly reminiscent of the fragmentation 
processes considered in [1]. 

(b) Branching tessellations in the whole space W^ admit a similar description in terms 
of division events. For each T G BT we can then define 



(2.6) 



vev we¥:WziV 



Conversely, for each VF G P one can recover the division events in W from &{T) via 

&{Tw) = \^{s,cnW,H) : {s,c,H) G ^(T), He (cRW^)}. 

It follows that T is uniquely determined by Tq and ^(T), and T can be regarded as a 
forest of infinitely many finite binary family trees of coloured cells, the roots of which 
correspond to the cells of the initial tessellation Tq of W^. 



Later on, it will be essential for us to keep track of the past of a branching tessellation. 
So, instead of considering the evolution T = {Ts)o<s<i in T, we will consider the process 
(Ts)o<s<i in BT, which is given by T^ = {Tu)o<u<s- Equivalently, T^ can be thought 
of as being obtained from T by removing from ^(T) all elements with time-coordinate 
larger than s. In this way, each T^ can be considered to be an element of BT, which is 
frozen at time s (and thus remains constant thereafter). The set of all such branching 
tessellations is denoted by BT^. In particular, BTi = BT, and BT„ C BT^, when u < s. 
We write 

(2.7) TTs : BT ^ BTs, T ^ T^, 

for the natural projection that removes the division events after time s. As before, 
the non-bold Tg stands for the tessellation at time s, whereas a bold T^ stands for an 
element of BT^. 

Besides this projection concerning time, we have also the projection to a spatial 
window VF S P, which is given by 

(2.8) TTiy : BT ^ MTw, T ^ Tw = {Tw,s)o<s<i with Tw,s = ^w{Ts), 

where MTw = t^w(W^) and -kw is as in ()2.2p . We also write Trw,s = '^w ° '^s, '~^w,s = 
7rvi/,s(T) and BTvi/^s = 7Vw^s{MT). So, to obtain Tw,s from T one has to remove from 
&{T) all division events with a time coordinate exceeding s or a hyperplane not hitting 
the cell's intersection with W. 

2.2.2 Branching random tessellations. Our main objects of interest are probability 
measures on BT. So we need to equip BT with a a-field. We know from Remark 12.41 
that each T € BT is uniquely determined by its initial tessellation Tq together with the 
set ^(T) of division events as given by (|2.5p and ()2.6p . Since ^(T) is a locally finite 
subset of (0, 1] X C X H, one can proceed as usually in point process theory by defining 
B = BiWf) as the smallest cr-field for which the counting variables 

(2.9) ^A,B-T^^\Tor\A\ + \S){T)r\B\ 

with A G B{<C) and B G 0((0, 1]) ® B{C) ® S(H) are measurable; here ^((0, 1]) denotes 
the Borel cr-field on (0, 1]. By standard theory, (BT, B) is a Borel space. For any window 
W G P, we define a cr-field Bw = B{WIw) on BT^ in the same way. To simplify notation 
we will not distinguish between the cr-field Bw on MTw and its preimage tv^ Bw on 
BT, which will be denoted by the same symbol. Anyway, with these definitions it is clear 
that both the projection ttw in (|2.8p and the time restriction map tt, : (s, T) i— )■ T^ of 
(j2.7p are measurable. 

Definition 2.5. A branching random tessellation (BRT) of W^ is a probability measure 
P on (BT, B) satisfying the first-moment condition 



(2.10) / P(dT) \Tw,i\ < oo for aU windows W G 



The set of all such BRTs of W^ is denoted by ^ = ^(BT). BRTs within a window 
Vl^ G Pu are defined analogously. 

For every P G ^ and any of the projections tt* in (|2.7p and (|2.8p we write P* = 
P o TT^T^ for the image of P under tt*. In particular, each P^ is a BRT. In fact, one can 
achieve that P^ depends measurably on s, in that the mapping [0, 1] x B 3 (s. A) i— )• 
Ps{A) is a probability kernel, as will be assumed throughout the following. This can be 
seen by disintegrating the measure 

(2.11) P:=^ dsJpidT)6^s,T.) 



on BT := {(s.T^) : s G [0, 1], T, G BT^; cf. [iQi Appendix 15.3]. Later on, we will also 
consider the projections Jvw = id (8" t^w that act on the second coordinate of BT as in 
(j2.8p and leave the first coordinate untouched, and the projection images Pw = ^°^w ' 
where VF G P. We also introduce the notation BT^y := 7fvi/(BT). 

2.3 Division kernels 

Consider a random element T of MT\y for a window W G Py The process (Ts)o<s<i is 
then automatically Markovian because its 'past' is part of the 'present'. In this paper 
we will focus on the 'nice' case in which the evolution of this Markov process is described 
by a rate kernel that specifies the jump times and transitions of (Ts)o<s<i. Since the 
only transitions are single-cell divisions by bicoloured hyperplanes, this means that the 
rate kernels take the following form. 

Definition 2.6. A division kernel is a measure kernel <1> from the set 

{(s,Ts,c) gIT X C: cGT^ 

to HI such that each <I>(s,Ts,c, •) is a finite measure supported on (c) C H. If $ is 
only defined for arguments in BTyi/ x Cw , ^ is called a division kernel for the window 

ly GPu- 

In the following, it will be convenient to work also with the cumulative division 
kernel 

(2.12) |.(s,T„-) = J^'^c®^(s,T„c, •) 

cGTs 

from IT to C X M. Note that, conversely, $(s, T^, c, • ) = ^{s, T^, {c} x • ). 

The next remark describes how a division kernel determines the evolution of a BRT 
within a bounded window. 

Remark 2.7. Local BRTs with prescribed division kernels. Let W & ¥\j he a fixed 
window, ^w be a division kernel for W, and (l)w{s,Ts) := ^w[s,Ts,Ts x (W)) the 
finite total mass of the cumulative kernel ^vy(s, Tg, • ). We construct a random element 
T of MTw as follows: 

(I) Pick an initial tessellation Tq G Tw according to some probability law Pw on Tw, 
and let sq = and Tq := Tq. Also, let i = 1 and proceed with the following 
random recursion over the number i. 

(R) Suppose that z > 1 and both a random time Sj_i G [0, 1] and a BRT Ts._j G BT^..^^ 
are already realised. Then take a random time Si G (sj_i,oo] with 'survival' 
probability 



(2.13) Prob(si > s) = exp -/ (f)w{u Al,Ts,_^)du 



for s > Si-i. 



If Si < 1, proceed to define an extension T^^ G BT^. of T^.^ as follows: Pick a 
random cell Cj G Tg.^ and a bicoloured hyperplane Hi according to the law 

^w{si, Ts^_i, •)/<^w/(si,Tg,_i). 

(Note that the denominator does not vanish for each possible choice of Sj.) Then 
let Ts = Ts^_-^ for s G (sj_i,Si) and Tg^ = 0c„HiiTsi_-^), that is, 

^(T,J = ^(T,^_JU{(s„c„F,)}. 

Next let i := i+1 and goto (R). In the case Sj > 1 let Tg = Ts^_j for s G (si_i, 1], 
set n = i—1, and stop. 



One needs to ensure that this algorithm terminates after finitely many steps. It is 
not difficult to show that this is the case if 

(2.14) sup <^w{s,Ts,c,{c)) =:(!)< oo; 

s,Ts,c 

see the proof of Lemma [4. 31 below. This lemma shows further that the process (T<()o<s<i 
can be characterised as the unique, in general time-inhomogeneous pure jump (i.e. piece- 
wise constant) Markov process in BT^k with initial distribution Pw and generator 

(2.15) l.^lg{Ts)= f l>H^(s,T„d(c,if)) [<7(0s,c.i^(T.))-9(T,) 

JTsXiW) L 

at time s G [0, 1]. Here, (Z)s,c,h{T^s) G BT^ 

is the branching tessellation that coincides with T^ for times less than s and equals 
0c,h(Ts) at time s, and g is any bounded measurable function on BT^i/. The distribution 
of T is a BRT Pyy in VF € P, and this Pw is called the BRT in W with division kernel 
$VF o-''^d initial distribution Pw- 

2.4 Examples of division kernels 

This section contains a few examples of division kernels; two simulation pictures are 
shown in Figure [2l The first is (by now) classical and will be used as a reference model 
throughout the following. 

Example 2.8. STIT tessellations. Let A be a locally finite measure on the set H of 
all bicoloured hyperplanes, which is invariant under all translations. That is, under the 
identification of ii' G HI with {u,r,a^ ,a~) G S^JT^ x M x S^, A can be written in the 
form 

(2.16) A(di?) = A(dii) dr /x(u, dcj+, do-"). 

Here, A is a measure on S,,^ , and /i is a probability kernel from S^~ to T?. (The 
translation invariance is expressed by the fact that the r-marginal is Lebesgue measure 
and /i does not depend on r.) A natural choice is the motion invariant measure Aiso 
for which A is the normalised surface measure Aiso on 8.^7 and /i(u, • ) = v ® v for a 
reference probability measure u on S. Then, a STIT tessellation with driving measure 
A is a BRT for the division kernel 

(2.17) A*(s,T„c, •) = A(- n(c)). 

In the uncoloured case, this model has been introduced by Mecke, Nagel and Weif^ 
[12l [13]. Since A* does not depend on the time s, the random holding times Sj — Sj„i in 
Remark 12.71 above are exponentially distributed and can be understood as minima over 
c G Ts^_^ of independent exponential times with parameter A((c)), which are associated 
to the presently existing cells. (In the isotropic case A = Aiso, the parameter A((c)) 
is precisely the mean width of c.) In other words, the tessellations evolve according 
to a continuous-time branching process on Cvkj VF G P, in which all cells c behave 
independently of each other, live for an exponential time with parameter A((c)) and then 
split into two parts according to the conditional distribution A( • |(c)). In particular, this 
implies that smaller cells live stochastically longer. 

In view of this independence of the evolution in different cells, it is clear that for 
each To G T there exists a unique whole-space BRT 11 (Tq, • ), called STIT tessellation 
of ffi , with initial tessellation Tq and driving measure A. In fact, if the support of A 
contains a linear basis of W^, one can also construct a unique BRT 11'°° = 11 (T(^, • ) 



for the degenerate initial tessellations Tq° that consist of the single 'cell' M"^ with any 
colour a; see [la Theorem 1] and ^ Theorem 1]. (By ([2l71) . II'^'^ does not depend 
on (7.) 

Formally, 11 is a probability kernel from T to BT. So, for each P € ^(T), PII = 
/P(dro) n (To, • ) is the unique BRT for A with initial distribution P. Its projections 
to arbitrary windows W^ € P are given by 






PwHA 



(2.18) {PW)0 7r^ =r'w^^w 

for the restricted STIT kernel IIj^ {Tw,o > ■ ) from Tw to MTw with the restricted driving 
measure A( • n {W)). The abbreviation STIT stands for stability under the operation 
of iteration of tessellations. An explanation and further remarkable properties can be 
found in [E], [S], [E], [IS]-[23] and 



A generalisation of the STIT models, which still keeps the independence of the 
division process for distinct cells, are the cell-driven BRTs, which have division kernels 
of the form 

(2.19) ^{s,Ts,c,dH) = ip{c,H)A{dH) 

with a density function (/?(c, H) on C x H which vanishes except when H ^ (c). A special 
case are the shape-driven BRTs investigated in |20j : see also the examples therein. 

The next example demonstrates the flexibility of modelling in the present setting: It 
combines an interaction between the colours of the cells with a geometric homogenisation 
mechanism and an aging effect. The last feature takes advantage of the fact that division 
kernels may also depend on the past. 

Example 2.9. Contact-induced mutations with size balancing and aging. Let the colour 
space be S = { — 1,1} and consider a division kernel of the form 

$(s, T^, c, dH) = ip{c, Vj) Aiso(dn) dr /i(s, T^, c, do-+) ^(s, T^, c, da"), 

where H = {i],a'^ ,a~) with spatial part rj = {u,r) G S^~ x M and colours a^ G S. A 
special choice of the geometric prefactor is 

V?(c,77) =el(c>(r?)+e"^l(£^c>(^), 





Figure 2: Simulations of two BRTs with isotropic selection of lines, two colours and the full 
window as single initial cell. Left: A STIT tessellation; colours are chosen at random. Right: 
Colour mutation and size balancing as in Example 12.91 but without aging. Here, e ~ 0.05, 
/3(s) = (1 + s)/2 and a mixed boundary condition as indicated. 
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for some small e > 0; here, e-kc = m{c) + £{c — m{c)) is the e-retraction of c. This choice 
has the effect that the cutting hyperplane will typically pass close to the midpoint m{c) 
of c, so that its two daughter cells have comparable size. One can further choose the 
colouring rule 

fl{s,Ts,C, •) = (Jcol(c) +/3(as,c,Ts:Sc,Tj'5-col{c) > 

where a^^cT^ = s — mm{u G [0, s] : c € T^} is the age of c at time s, 
Sc,Ts= X] vold_i(cnc')/vold_i(5c) 

c'GT^: co1(c')=-co1(c) 

is the opposite-type surface fraction (measured by the Hausdorff measure of dimension 
d—1), and /3 : [0,1]^ — > (0, oo) is a suitable positive function. For instance, f3 can be 
taken to be decreasing in a so that increasing age reduces the willingness of splitting 
and mutating. One can further let /3 be increasing in s. Then, the larger a cell's surface 
fraction is in contact with cells of opposite type, the more the cell gets 'nervous' and 
hurries to divide, and the more likely it is that its daughter cells mutate to adapt their 
type to that of the neigbours. 

Our third example may seem somewhat exotic. It will be used in Remark 13.121 to 
demonstrate that a BRT on the full space M is not necessarily uniquely determined by 
its initial distribution and its division kernel. 

Example 2.10. Directional infinite-range interaction. This is an uncoloured model, 
for which S is a singleton. We further confine ourselves to the planar case d = 2. Let 
Ahor(d-ff) = 5(0,1) (dn)dr and Avert (d-ff) = 5(i,o)(d'u)dr be the measures on H = S^ x R 
for which all lines are horizontal resp. vertical. For any cell c G C let diamhor(c) = 
maxx^y^c \xi — yi\ and diamvcrt(c) = maxx^y^c \x2 — 2/2! be the horizontal and vertical 
diameters of c, where Xi stands for the ith coordinate of x. Also, let 

Chor = {c e C : diamhor(c) > diamvert(c)} 

be the set of all 'horizontal' cells. Finally, writing [n] for the centred square of area n^, 
let 

Phoi{T) = limsupn~^|{c G Tn Chor : m,(c) G [n]}| 

be the upper density of horizontal cells for a tessellation T G T, and define pvert{T) 
analogously. Then let 

Thor = {T G T : Phor(r) > /Overt (r)} 

be the set of tessellations with a dominating fraction of horizontal cells, and Tvert = 
IT \ Thor- Consider the division kernel 

(2.20) ^(S,T„C, •)=lTho,(T,)Ahor(- n(c))+lT_,(T,)Avert(- H (c)) . 

Since Thor is invariant under translations and tail-measurable, this <l> looks at the actual 
tessellation 'at infinity' in order to decide whether the cutting line should be horizontal 
or vertical. 

2.5 Conditional BRTs 

In this section we introduce a Gibbsian perspective on BRTs. As is standard in the 
theory of Gibbs measures, one aims at describing a macroscopic system by means of 
its local conditional distributions that describe the behaviour inside a bounded region 
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when the remaining system is fixed. Here we define such conditional distributions in the 
context of BRTs. Let W £¥ he a fixed window. 

2.5.1 Inner and outer projections. Recall from (|2.2p and ()2.8p that the projections 
TTw and TVw are defined by intersecting the cells with W, and thus wipes off much 
information on the cell geometry (such as, e.g., the location of midpoints). To avoid 
this, we introduce the 'inner' projection 

(2.21) vr|^ : T ^ Ti^, T ^ T^ := {c e T : c C int(VF)}, 

which removes all cells which are not completely contained in the interior of W^. It 
takes values in the set T^ of all possibly empty, not necessarily connected collections of 
cells inside W with pairwise disjoint interiors. The counting variables N^ in ()2.ip are 
even defined on T'^ and generate a cr-field B{T'^), for which ttJ^ is measurable. As the 
cells of Ty^ are even required to be contained in the interior of W, T^ is a measurable 
function of T]y. 

In the same way, we define the inner projection 

(2.22) Tri^ : BT 9 T ^ T'^ = (T^, Jo<s<i 

on BT, where T^^ = 7rJ^(Ts). Arguing as in Remark 12.41 one finds that Tj^ is uniquely 
determined by T^q, ^(T^r), and the finite set of all 'immigration events' {s,c) with 
T^g = T^g_ U {c}. Consequently, one can generate a cr-field on the range BTj^ of ^^^r 
by means of counting variables similar to those in (|2.9p . so that ttJ^ becomes measurable. 
Note also that tt^t = ttJ^ o 7v\y. 

Complementary to the above, we also introduce an 'outer' projection for W by 

(2.23) vr^* : T 9 T ^ T^* := T \ T^ = {c G T : c \ mt{W) / 0}, 
and a 'boundary ' projection 

(2.24) TT^ : T 3 T ^ Tr^\Tw) = {cHW : c e T, cH dW ^ H)}. 
Likewise, on the level of branching tessellations we define 

(2.25) TT^* : BT 9 T ^ T'^' = (^^Hr.))o<,<i , 

(2.26) TT^ : BT 9 T ^ T^ = K(r,))g<^<^ = 7r^*(Tvy) • 

In the forest picture of Figure [H each T;^* in the range BT,^* of ttj^* corresponds to 
a forest of binary trees from which all cells within W are erased. So, one can use the 
counting variables in (j2.9p to generate a a-field on BT^JJJ*, and '^'\^^ is then evidently 
measurable. The same applies to tt^^. Furthermore, to keep the full information on the 
initial tessellation in W^ resp. in W, it will also be convenient to introduce the mappings 

(2.27) TT^/^* : T^(r^,o,Tr), 

(2.28) tt"/ : T^(r^,o,T^)- 

For each of the projections tt^^^ in (12:22]) . ([225]), (Q2B, (I22ZD, and (l2:28|) . we write 
B^ = cr(7r^) for the cr-field on BT that is generated by this projection. By abuse of 
notation, we will use the same symbol 0^ for the cr-field on the range of tt^. 

2.5.2 Conditional BRTs. Let T € BT any branching tessellation. Consider the 
time-dependent 'inner' window 

(2.29) inwis, T^) := W \mt{[j{c : c G T^J) = U{c : c G T^^^J, 
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which is possibly empty and not necessarily connected. It is measurable jointly in both 
arguments, piecewise constant and right-continuous as a function of s. Let 

(2.30) < ti = h{Tfy) < . . . < t„ = t„(Ta,)(T^) < 1 

be the jump times of the path s i— )> int^(s,T^), which depend measurably on T^. 
(Note that possibly n(T^) = 0. For the sake of convenience, we also exclude the case 
that there is a jump at time 1, which occurs with probability zero.) At each ti, T^ 
creates a new cell Cj inside W, namely 

a = q(T^) := d{mw{ti,T^) \ mw{ti_i,T^)), 

where to = 0. In other words, T^ induces a process of immigration of cells into W. 

Definition 2.11. Let <l> be a division kernel and suppose that the following random 
process S = {Ss)o<s<i with S U T^* := {Ss U r^*)o<s<i G BT is well-defined: 

- Let Sro^t^) = {Ss)o<s<ti be the BRT in the window iniy(0, T^) with time interval 
[0,ii), initial tessellation 5*0 = T^q and division kernel 

$l^(s,S„c, •|T^*):=$(s,S,UT^,*„c, •) forced,, sG[0,h). 

- For i = 1, . . . , n and conditional on St,_ let Sj^^^^.^^) = {Ss)ti<s<ti+i be the BRT in 
the window iniy(tj,T^) = q U inty(ii-i,T^) with time interval [ij,ti+i), initial 
tessellation St^ = St^- U {q} and division kernel 

$l^(s, S„ c, • |T^') := $(s, S, U T^;„ c, • ) for c G Ss, s e [k,U+i). 

Here tn+i = 1, and we finally set Si := Si- . 

The distribution of S on BT|^ will be denoted by G^y( • |Tpy^" ) and is called the con- 
ditional BRT for ^inW with initial tessellation T^q and boundary condition T^y*. 

By construction, G^ is a probability kernel from (BT^^" ^^w^ ) ^° i^'^w^^w)- 

Example 2.12. Conditional STIT tessellations. As in Example 12.81 let A be a loc- 
ally finite measure on H and A* be the associated division kernel; cf. ()2.17p . Then, 
G^( • |T^°"*) := G^( • |T^°"*) is simply the distribution of 

n 

(i) 



U s(^)u|Js 



pT^in i=l 

for independent random STIT tessellations S^'^) and S^'^ for A. Here, S^'^^ evolves in time 
[0,1] from the single-cell tessellation Sq = {c} of the initial polytope sp(c), whereas 
S^*-* evolves in time [ti,l] from the single-cell initial tessellation Sl^ = {q} of the 'im- 
migrated' polytope sp(cj) and is extended to the full interval [0, 1] by setting Ss = 
for s € [0,tj). Since A* does not depend on the surrounding tessellation, it follows that 
the measure G^( • jT^^" ) depends only on T^ . 

Here is the natural counterpart of the concept of (macroscopic) Gibbs measures in 
our setup of branching random tessellations. 

Definition 2.13. Let <I> be any division kernel. A BRT P G ^ is called a Gibbsian 
BRT for $ if P^°"* O G|. = P for allW e¥ or, more exphcitly, if 

P(7ri;^ G A I ;B^°"*)(T) = G^{A\ T^°"*) for P-almost ah T 

for all AeB^ and W £¥. 
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2.6 Translation invar iance 

A main focus of this paper is on BRTs that are invariant under spatial translations. For 
each X € M , the translation -d^ by the vextor —x acts 

- on cells c € C via 'd^ '■ c^^ c — x := (sp(c) — x, col(c)), 

- on bicoloured hyperplanes i? G IHI via 

-d^ : H ^ H - X := (sp(i7) - x, col+(F), col" (F)), 

- on tessellations T G T via 

^^ -.T ^ T - X := {c - X : c eT}, 

- on branching tessellations T = {Ts)q<s<i G IBT via 

^^:T^T-x:=(r, -x)o<.<i. 

That is, only the spatial coordinates are shifted, but the colours remain unchanged. 
Moreover, by abuse of notation we use the same symbol 'd^ for the translation on each 
level, and we will also use it for the simultaneous translation of pairs of objects as above. 

Definition 2.14. A BRT P € <^ is called translation invariant if it is invariant under 
the action of the translation group G = {'&x)x£K.'^ o^ Wf, in that P o 'd~^ = P for all 
X £ M'^. We write ^e = =^0(1BT) for the set of all translation invariant BRTs that 
satisfy the first-moment condition (j2.1(jp . which by translation invariance is equivalent 
to the requirement that the 'hitting intensity' 

(2.31) ii(P):=y"p(dT)|T[i],i| 

is finite. Here, [1] := [—1/2, 1/2]"' stands for the centred unit cube. 

Translation invariance allows to investigate the behaviour of a random tessellation 
'around a typical cell', which for convenience is located 'around the origin'. This is 
formalised by means of Palm calculus as presented in [101 Chapter 12] and |17l The- 
orem 4.1.1]. Let P G ^e be given. Then, the Campbell measure of P on BT x C is 
defined by 



(2.32) C^= /"p(dT) J2' 



\t,c) 



It captures the joint distribution of the (terminal) cells and the complete history of their 
surrounding tessellation. The Palm calculus now states that there exists a finite measure 
P*^ on BT X Co, the so-called Palm measure of P, such that the Palm formula 

(2.33) / dC^(T, c) f{m{c),c - m(c), T - m{c)) = f dx f dP°(T, c) /(x, c, T) 

holds for any non-negative measurable function / on M x Co x T. Its normalised marginal 
on Co is called the typical cell distribution. 

Later on, we will often consider the integral over time s of the Campbell measure 
and the Palm measure of the projected BRTs P^, and it will be convenient to have a 
shorthand notation for these objects. So we define the extended Camplell measure 

(2.34) C^ = f ds I'fMTs) Y1 ^(^,t.,c) 

Jo J ^gy^ 
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and the extended Palm measure 

(2.35) P° = y dsy"dP°(T„c)5(,,T.,c). 

For W (z¥ we similarly define the extended local Campbell measure 

(2.36) C^""' = [ ds [ I>wA^T^w,s) Yl hs,T^,,cy 

Also, we will often use the time-integrated version of the Palm formula ()2.33|) . where 

the Campbell measure C^ and the Palm measure P*^ are replaced by their extended 

— p — 

relatives C and P , respectively. For example, combining the time-integrated Palm 

formula with the first-moment condition ()2.3ip we find that the total mass of P can be 
estimated by 

(2.37) ||P°|| = /"dP(s,T,) \{c G Ts : m{c) G [1]}| < ii(P) < oo 

We conclude this section with some comments on random, but not branching, tes- 
sellations P € ^(T). These can be considered as BRTs by identifying the space T 
with BTq. In particular, it is then clear what translation invariance means, and we can 
introduce the set 3^q{T) of all translation invariant random tessellations P that satisfy 
the first-moment condition 

(2.38) io{P):= f P{dT)\T[,]\<oo. 

Since io(P ° ttq" ) < ii(P), the initial distribution of each P € ^q satisfies (j2.38p . 

3 Results 

Our results use a STIT tessellation as a reference model. Therefore we fix a locally 
finite reference measure A on H which is invariant under translations. Moreover, we 
write n (To, • ) for the associated STIT kernel, as introduced ibidem. 

3.1 Division kernels for BRTs 

As we have seen in Remark 12.71 the concept of division kernels allows to introduce a 
convenient class of local BRTs. Our first goal here is a global counterpart. That is, we 
ask whether the evolution of a global BRT P G ^ can be described by a division kernel 
^. As we will see, this is the case whenever P is locally absolutely continuous with 
respect to the STIT model PII with initial distribution P = P o ttq" , in that 

(LAC) Pw < Pw'^w for allW eF; 

recall that Pw'^^ = (PH^) o tt "^ by (1238]) . 

We note in passing that (LAC) also implies that the realisations of P almost surely 
exhibit a 'tame' geometry. Namely, in the planar case, they show exactly one type of 
vertices, the so-called T-vertices, at which an endpoint of a line segment hits an inner 
point of another line segment (provided this holds already for the initial tessellation); 
see [13] [T5] and the references cited therein. 

To state the result we recall that the extended measure P and the extended pro- 
jections 7V\Y have been introduced in and after (|2.11|) . and that the operator 0s,c,H is 
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defined after (j2.15p . Further, we will need a projection that refers to the cell division 
procedure. Namely, for W €¥ we introduce the projection 

(3.1) ^w-{c,H)^{cnW,H) 

on C X H, which for each T € T maps the set 

^^^A{Tw) := {(c,F) :c£T,He{cnW)} 

onto A{T\y) := |(c, H) : c G T\y, H G (c)}. Finally, a real function on BT is called local 
if it is y^Hz-measurable for some H^ S P. 

Theorem 3.1. For each P S ^ satisfying (LAC) there exists a division kernel <1> for 
P such that the following holds. 

(a) For all non-negative measurable functions f on BT x C x H, 
/"p(dT) Yl f{s,Ts-,c,H) 

= J dPis,T,) I ^{s,Ts,d{c,H)) f{s,Ts,c,H) . 



(b) The (in general time-inhomogeneous) Markov process T = (T5)o<s<i in BT with 
distribution P satisfies the forward equation with generator 

Lfg(T,) = l^{s,Ts,d{c,H)) [g{0s,c,H{Ts))-9{Ts)] , 

at time < s < 1; here, g is any bounded local function on BT. 

(c) P is a Gibbsian BRT for <l>. 

(d) For every VK G P, the local projection P^i/ is a BRT in W for the localised division 
kernel ^W; which is defined as a regular version of the conditional measure 

where s G [0, 1] and B is any measurable subset of Cw x (W) ■ 

Definition 3.2. If P and <^ are such that the properties (a) to (d) above hold, we will 
say that P admits the (global) division kernel <&, or that ^ is a division kernel for P. 

Statement (a) characterises the 'jump intensity measure' in terms of $. It implies 
statements (b) and (c), which describe the temporal and the spatial aspect of P, re- 
spectively. So, (b) asserts that the global process with distribution P evolves in the very 
same way as the local processes in Remark 12. 7|, in that a cell c in environment T^ at time 
s is split by a bicoloured hyperplane H with instantaneous intensity ^(s, Tg, d(c, H))ds. 
On the other hand, by the definition of Gibbsian BRTs, statement (c) means that $ 
also describes the cell splitting mechanism within a local window when the evolution of 
all other cells is given. The final statement (d) addresses the (unconditioned) marginal 
process in a local window W and shows that its division kernel is obtained by the natural 
averaging over the possible environments outside W. (The converse question of whether 
a BRT for a given division kernel exists will be addressed in Theorem 13.101 ) 

We can also state that the division kernel for a translation invariant BRT can be 
chosen to be covariant. 
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Corollary 3.3. Each P € ^q satisfying (LAC) admits a division kernel $ which is 
covariant under translations, in the sense that 

(3.2) ^s,Ts,'&-' ■) = Hs,Ts-x,-) 

for all X G M and all (sjT^) G BT. Moreover, the local division kernels can be chosen 
to be covariant in the sense that 

(3.3) ^w{s, Tw,s, ^^"^ • ) = ^w-x{s, Tw,s - x, ■) 
for all X G W^ , {s, Tvy,^) G Wfw and TV G P. 

3.2 The inner entropy density 

We now turn to a 'thermodynamic' investigation of translation invariant BRTs. Our goal 
here is an appropriate notion of entropy. Recall that the relative entropy, or Kullback- 
Leibler divergence, between two probability measures /i and z^ on a common measurable 
space is defined to be 'H{^;v) = J log/ d// if // <C z^ with Radon-Nikodym density /, 
and +00 otherwise. It can also be written in the form 



(3.4) n{fi;u) = J 0{f)du, 

where g is the non-negative convex function 

(3.5) g: a\-^l — a + a log a. 

The formula ()3.4p readily shows that ?^(/i;z^) > with equality precisely when ^ = v. 
We can also take it as the definition of relative entropy in the more general case when 
/i and V are finite, not necessarily normalised measures. 

Further, if ^ is a sub-u-field of the underlying u-field then the conditional relative 
entropy given A is defined as 

(3.6) 'H(/_f; p\A) = I ■H(/U^( • \x);vj^{ ■ \x)) fi{dx) , 

where //y^( • \x) and z^y^( • |x) are conditional measure kernels given A for // and v, re- 
spectively (provided such kernels exist). 

In our setup, we take the STIT model for A as our reference measure and introduce 
an 'inner' entropy as follows. Recall the definition ()2.28p of tt^^ and its associated 
(T-field By^ = o"(7ry^ ), and Example 12.121 for the definition of the kernel G^. 

Definition 3.4. Let P G ^ be a BRT and VF G P. The inner entropy ofP in W is 
then defined by 

(3.7) n^iP) := n(Pw; Pw,oK\b'w^) = niPw; f%^® g^) ■ 

(According to physical convention we should add a minus sign, but here we prefer to 
ignore this convention.) 

So, the attribute 'inner' means that this entropy compares the evolution of P with 
that of the STIT model only for those cells that are completely contained in W, while 
the evolution of all other cells hitting W is ignored. In the setup of Gibbs measures on 
general graphs, an analogous concept of inner entropy has been considered by FoUmer 
and Snell M. 
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Next, let [n] := [— n/2,n/2]'^ denote the closed centred cube of volume n'^. For a 
translation invariant BRT P S ^e^ one expects that the limiting inner entropy per unit 
volume 



lim n-^n]l^{P) 



exists, which is then called the inner entropy density of P (relative to the reference 
STIT cutting rule A). Indeed, our result is the following; see ()2.35p for the definition of 



the extended Palm measure P . 



Theorem 3.5. For each P € ^q there exists the possibly infinite limit 

/.-(P):= limn-'^nUP). 



If this limit is finite, P admits a translation covariant division kernel $, and 
(3.8) 



/ii'i(P)=7^(P°0$;P°0A*) 



ydP°(s,T„c) n{^{s,Ts,c, •);l(c>A) . 



So, the inner entropy density /i'"(P) is the conditional relative entropy of its division 
kernel $ with respect to A* when the branching tessellation and its cell are selected 
according to the extended Palm measure P . In particular, if /i™(P) is finite then the 
division kernel $ of P admits a Radon-Nikodym density with respect to A* . 

It is natural to expect that the relative entropy density is affine and lower semi- 
continuous with compact level sets, at least under some natural caveats. We show this 
for a topology that is finer than the common weak topology, but is not metrisable. 
Namely, we define the topology rioc of local convergence on 3^ as the coarsest topology 
for which the mapping P i— )> //dP is continuous for every bounded local function /. 
It is then clear that i^e is closed in ^ . Recalling the definition ()2.3ip of the hitting 
intensity «i(P), we can then state the following. 

Theorem 3.6. The inner entropy density /i™ is affine and lower semi- continuous in 
Tioc- Moreover, for any two constants < /3,7 < oo and every P € ^@{T), the restricted 
level set 

^e,P,/3,7 := {P e ^e : P o tt^-^ = P, ii(P) < /3, h'^'iP) < 7} 
is compact and sequentially compact in tiqc- 

3.3 Variational principle and existence 

Here we change our perspective: Rather than describing a given BRT in terms of its 
division kernel $, we will now suppose that a 'nice' division kernel \I' is given in advance. 
As we will see, ^ gives rise to an 'inner energy' functional on ^s, and further to an 
associated 'inner free energy', which in turn leads to a variational principle and an 
existence proof for BRTs with division kernel ^. Here are the conditions on ^ we need. 

Definition 3.7. Let us call a division kernel ^ moderate if there exists a measurable 
density function ^ on the set 

{{s,Ts,c,H) :0<s<l, T, gBT,, c G T,, H e (c)} 

satisfying 

^(s, T„ c, dH) = V(s, T„ c, H) 1^,) (H) A{dH) 

such that the following holds for all arguments: 
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(Ml) ip is covariant under translations, in that ip{s, Tg, c, H) = 'ip{s, Tg—x, c—x, H—x) 
for all xeW^. 

(M2) ^ has hounded range, meaning that there exists a constant < r = r\j, < oo 
such that ■i/'(s, T^, c, H) = ip{s, T^, c, H) whenever Tc+Br,s = "^'c+Br s- Here, Br 
stands for the closed centred ball with radius r. 

(M3) ip is bounded and bounded away from zero; that is, there exists a constant Kiir < oo 
such that I log -01 < i^'S'- 

(M4) ^ is approximately STIT for large cells, which is to say that there exists a con- 



stant K^ < oo such that 



/ AidH)\ij{s,Ts,c,H)-l\ <4. 

J(c) 



'(c> 

(In view of the boundedness assumption (M3), this condition involves only the 
cells c for which A((c)) is large.) 

The assumptions (Ml) to (M4) are trivially satisfied if ^ = A* is a STIT kernel as in 
Example 12.81 More generally, assumptions (M1)-(M3) hold for the cell-driven division 
kernels in ()2.19p whenever the density ip there is uniformly bounded from above and 
away from zero; (M4) can be achieved by setting ^ = A* for cells with large radius. In 
Example 12. 9t (Ml) trivially holds, (M2) holds for each r > 0, and (M3) follows from 
the assumptions on /3 stated there. Example 12.101 violates the bounded-range property 
(M2) in the most extreme way conceivable. 

A moderate division kernel induces a functional on ^e which, in analogy to the 
standard Gibbs theory, may be called the (negative) inner energy in W for ^, and is 
defined by 

(3.9) U^{P;^)= fp{dT) Yl log^P{s,T,_,c,H)■, 

is,c,H)€&{T):cCW 

P £ r0^e. (Note that in statistical mechanics the energy always appears negatively in 
the exponent; so it should not be surprising that logV' shows up here. But we suppress 
the minus sign.) Likewise, there is a term which comes from a normalisation (in our 
case of the distribution of jump times), and thus may be considered as an analog of 
the pressure in statistical mechanics. In the present setup, however, this quantity is not 
only a functional of ^, but also of the BRTs P € ^e, viz. 

(3.10) V^(P;^)= [dP{s,Ts) Y. f MdH){ij{s,T„c,H)-l). 



c&Ts-.cdW 



Theorem 3.8. For every moderate division kernel ^ and every P € i^e admitting a 
covariant division kernel $, the following finite limits exist and can be identified: 

^in/p.^N := lim n-'^ZY?^,(PM;^) = flog V' dP^0 $ , 
^in/p.^) := lim n-'^V?^(Pr„i;*) =/'('(/'- 1) dP°0 A* . 

n— >-oo l"J L J 

In particular, |n'°(P;^)| < K*ii(P) and |t;''^(P;^)| < K'^ii(P). 

The energy terms above can be combined with the inner entropy density to define 
the inner excess free energy density o/P for ^, namely 

(3.11) h'^'iP; ^) := /i''^(P) - u'^'iP; ^) + v'^'iP; ^), 
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where the right-hand side is set equal to +00 if /i™(P) = +00. In fact, in the finite case 
it will turn out that 

(3.12) h'\P;^)= ldP^{s,Ts,c) nms,Ts,c, •);^(s,T„c, • )) , 

where $ is a division kernel for P. The following variational principle for BRTs is then 
immediate. 

Theorem 3.9. Let ^ be any moderate division kernel. A BRT P G ^q then admits ^ 
as its division kernel if and only i//i™(P; ^) = 0. 

In particular, this can be used to prove the following result. 

Theorem 3.10. For any moderate division kernel ^ and every P S ^q{T), there exists 
a translation invariant BRT P G ^e with initial distribution P and division kernel ^. 

There is a large variety of initial random tessellations P to which this existence 
theorem applies. The most common examples are the Poisson-Voronoi tessellation, the 
Poisson-Delaunay tessellation, and the Poisson hyperplane tessellation, which are well- 
known to satisfy the moment condition (j2.38p . Further examples are the Delaunay 
tessellations that are constructed from tempered Gibbsian point processes with tile 
interaction, as studied in [6]. Unfortunately, we cannot allow a start in a degenerate 
tessellation with the full space M as its only cell (of any colour), which would be of 
major interest; cf. the discussion in Example 12.81 on the STIT measure 11 '°°. However 
and as already indicated above, we can choose the initial distribution P to be the time-5 
distribution of 11'°° for some small 6 > 0. Up to a time shift, this means that there 
exists a BRT with degenerate start for any moderate division kernel ^ with an initial 
cutoff of the form iIj{s, •,•,•) = ! for < s < 5 and some small 6. In the special case 
of shape-driven tessellations as in (|2.19|) . the existence of a BRT with degenerate initial 
tessellation has been proved in [20] under regularity assumptions. 

Since /i™( • ; ^) is affine, the last two theorems imply the following. 

Corollary 3.11. For any moderate division kernel ^, the convex set^Q{^) of all trans- 
lation invariant BRTs admitting ^ is a face of ^q . That is, the extremal elements of 
Sfe(^) '^'^e in fact extremal in ^e, and thereby ergodic under translations. 

It is clear that for each ergodic P G ^e(^) its initial distribution P = PotTq" is also 
ergodic. The converse holds whenever the correspondence between an initial distribution 
P G ^e(T) and its associated P G ^q{^) is one-to-one. This, however, does not hold 
in general, as our concluding remark shows. 

Remark 3.12. Uniqueness and phase transition. It is natural to ask whether or not 
the convex set ^(P, <I>) of all BRTs with initial distribution P G ^(T) and division 
kernel <I> is a singleton. In general, this is not the case. To provide an example, let 
d = 2 and consider the division kernel $ defined in Equation (|2.20p of Example 12.101 
Let Tj-eg = {[1] + i : i G Z^} be the regular tessellation of M^ into unit squares and 
P G ^e(T) be given by P = /r^j dx ^T.eg-x- Further, let P^°'' be the STIT tessellation 
with initial distribution P and driving measure Ahoj. as introduced in Example l2.10l and 
define p™''* analogously. It is then clear that these BRTs live on the spaces Thor resp. 
Tvcrt for all positive times. As a consequence, P^^™ and P™''* are two distinct BRTs 
which both belong to $^e(^) and have the same initial distribution P. 

Although the infinite-range interaction of this example is somewhat artificial, we 
learn that uniqueness does not hold automatically. Instead, the phenomenon of non- 
uniqueness, or phase transition, which is a central issue of statistical mechanics, shows 
up also in the present setting. In analogy to standard results on Gibbs measures (cf. 
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[H Section 8.3]), we will show in Proposition 14.171 below that uniqueness does hold for 
suitable division kernels of bounded range in one spatial dimension. Uniqueness is also 
known in the non-interacting case ()2.19p when the initial tessellation is degenerate and 
the density (/? exhibits some regularity properties |20j . We leave it to the future to find 
sufficient conditions for uniqueness in higher dimensions as well as examples of bounded- 
range division kernels exhibiting phase transition. In fact, Figure [2] (right) suggests that 
a phase transition might already occur for the (moderate) model of Example 12.91 

4 Proofs 

4.1 Some properties of local BRTs 

Before entering into the proofs of our results we will establish some auxiliary properties 
of local BRTs. First we will express the local evolution of a BRT in a more explicit 
form. Throughout this section we let VF € Py be an arbitrary window. For any division 
kernel $vy in W we introduce the abbreviation 

(4.1) 4>w{a,b;T)= I 0H^(s,T,)ds, 

J a 

where 0<a<fe<l, Tg MTw and (f^wis, Tg) is as in Remark 12.71 For every Tq G Ty/ 
we define a measure on WS\\r by 

(4.2) «>o {o<s-;<...<;„<i} ^=1-^ 

X exp [- (^H/(0,1;T))] l{7ro(T)=To,5^(T)={(si,Ci,//i):l<J<n}} '^T , 

where sq := and the last indicator function simply means that T is is the unique 
branching tessellation which starts from Tq and is successively defined by the division 
events {si,Ci,Hi); recall ()2.5p . 



Lemma 4.1. Let ^w be a division kernel and Pw S J^{MT\y) a BRT in W. Then the 
following statements are equivalent. 

(a) I*w admits the division kernel ^w- 

(b) Hy^ (Tq, ■) is the conditional distribution ofPw given 7ro(T) = Tq. 

(c) For every non-negative measurable function f on MTw x Cvk x (W), 

fPwidT) Yl fis,Ts-,c,H) 

(4^3) {s,c,H)&Sl{T) 

= j ds j PwA^T^s) J ^w{s,Ts,d{c,H)) f{s,T,,c,H). 

(d) PvF has the infinitesimal generators L,^ of (j2.15p . in that the forward equation 

(4.4) I g dPw,t - j g dPwfi = j "^^ j dPiy,« ^w^sd 

holds for all t G [0, 1] and bounded measurable functions g on BTyi/. 
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Proof, (a) implies (b). Recall the recursion steps from Remark 12.71 For i = 0, Tq = Tq 
is chosen according to the distribution Pi^^o- For each i > 1, conditionally on the first 
(i— 1) division events, the zth division event {si,Ci,Hi) for a random tessellation T with 
division rule ^w is chosen according to the distribution 

exp [ - ^wisi-i,Si;T)] dsj ^w{si,Ts^_-^,d{ci, Hi)) ; 

here we have used that T^ = T^^^ for s € [sj_i,Si). So, on the event {|^(T)|}, the 
joint distribution of the elements of ^(T) is the product of these conditional measures 
for i = 1, . . . , n, times the probability that s„ is the last division time before 1, which is 
exp [-<^w'(s„,l;T)]. 

(b) implies (c). Fix any initial tessellation Tq. On the set of all T with fixed 
number |^(T)| > 1 of division events, the measure Hyy^ {Tq, ■ ) has a product structure. 
The elements of ^(T) can be labeled with i € {1, . . . , n} according to their temporal 
order. For each i we extract the ith term from the product, omit its index i, and 
separate the terms concerning the division events before and after time s = Sj. That 
is, we write ^(T) = &' L) {{s,c,H)} U &" and separate the respective conditional 
measures. By Fubini's theorem, s can be considered as fixed. For given s, & has the 
same distribution as ^(T^), but we still have the condition that the extracted division 
event {s,c,H) has rank i in ^(T). This condition disappears by summing over i and 
n > i. Finally, the division events in &" can be integrated out because these do not 
enter into f{s,Ts-,c,H), and an integration over Tq gives (c). 

(c) im,plies (d). Applying Equation (|4.3|) to the function 

fis, T, c, H) = l[o,t] (s) [gi0s,c,HC^s)) - giTs)] , 
we find that for each t € [0, 1] 

f ds fdPw,s C!^5 = fPwidTw) J2 [5(Tm/,s) - 9{Tw,s-)] 

= JPwidTw) [giTw,t) - 9iTw,o)] = f 9 dP w,t - f 9dPw,o- 



(d) implies (a). In principle, this follows from Reference [7] which, however, makes 
use of a time-continuity condition on ^w- We thus indicate a direct argument. For 
brevity we omit most indices referring to W . Let < s < t < 1 and PjiTs be a regular 
version of the conditional probability oiPw.t given 7rvi/,s = Tg. Using ()4.4p for a function 
g of the form g = 1a1_b with A G Bw,s and i? € <t(T i-^ (Tvi/,n)s<n<i) and varying A, 
one readily finds that 



(4.5) Pt|T. {B) - 5t, (B) = l^uj dP,|T, L*l 



for almost all Ts. We now fix Ts and think of each T„ as an element of BT^y which 
is constant on [u, 1]. Also, for T S MTw we let r(T) be the time of the first jump of 
T after time s, which is set equal to oo when there is no jump during [s, 1]. Setting 
B = {T e MTw ■■ t{T) > 1}, we then find from (gSD that 

Pt|T, {T>t) = l- j du 4>iu, T,) P,|T, (r > u) 

and therefore P(|Ts(''" > ^) = exp[— (/>(s, t; T^)]. In other words, r has the condtional 
distribution used in Remark 12. 7[ 

Next, let F C C X M be measurable and 0r(r^) = {(Z)c,h{Ts) : c & T^, {c,H) G F}. 
Consider the set B = {T e MTw ■ Ti G (Z)r{Ts)} and let r2(T) the time of the second 
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jump of T after s (which again is set equal to oo if no second jump exists). Then 
1b(T„) = l{T<«<T2,T^G0r(T^)}('^) fo^ u> s, and (03]) imphes that 

Pi|T,(T<t<r2, r, G0r(r,)) = f du$(n,T„r)P,|T,(r>u) 

J s 

- [ Pt|T. {{r, Tr) e d(^, T,)) / du 4>{u, T,) P,|T, {t>u). 

J(s,t\x(Z)r(T.,) Jv 



Using the exphcit conditional distribution of r derived above, we thus find that the first 
term on the right-hand side of the above equation is equal to 

/ du^u,Ts,r) exp[-4>{s,u;Ts)], 
whereas the second term equals 

/ Pi|T, ((r, Tr) G d(7;, T„)) P^it, (v < t < t) 

J{s,t]x(Z,r(Ts) 

= Pt|T.(^2<t, T, e0r(r.)). 
We thus arrive at the equation 

Pf|T, {r <t,Tre 0r(r,)) = f du exp[-^{s, u; T,)] ^{u, T„ T) , 

J s 

which assures that (r, T,-) has the correct conditional distribution of Remark [221 □ 



Note that the joint integrating measure on the right-hand side of ()4.3p can be written 
in the concise form Pw(E>^w or, equivalently, C ^(8)$, where Pw = Po^f^ , P is given 
by ()2.1ip and C by ()2.36p . We will switch between both representations according 
to convenience. 

Corollary 4.2. Let Pw,Qw € ^{MTw) be two BRTs in W. Suppose Qw admits 
a division kernel ^Wj and Pw *C Qw- Then there exists a measurable function 
ipwis,Ts,c,H) > such that the measure kernel 

^w{s, T„ c, d^) := ipw{s, T„ c, H) ^^/(s, T„ c, dH) 

is a division kernel for Pw- 

Proof. For brevity we introduce the measure kernel 

is,c,H)eS>{T) 

for T G BTvy. The integration on the left-hand side of (|4.3p is then with respect to the 
measure P^yDvi/. Since Pw <^ Qw by assumption, it follows that Pvi/Dvy ^ QvyD^y 
with a Radon-Nikodym density /, say. It also follows that Pw ^ Qw with a density g. 
Define 

<fw{s, Ts, c, H) = f{s, Ts, c, H)/g{s, T,) 

if the denominator is positive, and zero otherwise. Then we obtain, using equation (|4.3p 
for (Qvi/,^vy) in place of {Pw,^w), 

Pw'Dw = fiQw'Dw) = f(Qw ^) = Pm/ O i^w'^w)- 
In view of Lemma HTH this means that Pw admits the division kernel ^w '-= fw'^w- D 
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Finally we look at the first-moment condition (j2.10p . 

Lemma 4.3. Let <I>vy be a division kernel for W , and suppose its total mass satisfies 
the uniform bound (j2.14p . Then 



Jn^{To,dT)\n\<e'''\To\ 



for all initial tessellations Tq G Tw- Moreover, for every e > one can find a number 
T < oo such that 

lup^iTo,dT){\T,\-T\To\)^<e\To\ 

for all Tq G Tvk- 

Proof. Recall the description of 11 ^^'(To, • ) in Remark 12. 71 The algorithm there implies 
that, for each i with Sj < 1, the holding time Sj — Sj_i dominates an exponential time with 
parameter |rs^_J(/), independently of the previous recursion steps. Hence, the process 
\Ts\ is stochastically dominated by the Furry- Yule process Zg S N with birth rate (p, 
namely the pure birth Markov process which starts in k = \Tq\ and jumps from any 
J > 1 to j+1 with rate j(j). Equivalently, Zs can be described as the branching process 
in which each individual, independently of all others, lives for an exponential time with 
parameter (p and then splits into two offspring. In particular, the descendance trees of 
each of the k ancestors are independent, and it is sufficient to look at the number of 
descendants at time s in each of these trees. This number is known to have the geometric 
distribution with mean e'^'^. A proof of this can be found e.g. in [1^, Examples 6.4, 6.8 
or Exercise 6.11. 

As for the second assertion, we conclude from the convexity of the function a i— >• 
(a — r)+ that 

(|Ti|-r|ro|)^< J](|r,,i|-r)+. 
cero 

Here, \Tc^i\ is the number of descendants of the initial cell c at time 1, which is stochastic- 
ally dominated by the geometric random variable Z^. As E(Zi — r)_|_ — > as r — )• cxd, 
the result follows immediately. D 

4.2 Prom local to global division kernels 

Let P G ^ be given and suppose (LAC) holds. Corollary 14.21 then implies that for 
each VF G P there exists a cell division kernel ^w in W such that Pw is a BRT for 
$VK- (In fact, $vy is absolutely continuous with respect to A*, but we do not need this 
here.) Therefore, our first step in the proof of Theorem 13.11 will be to construct a global 
common extension $ of these kernels ^w- Then, in a second step later, we will verify 
the properties of this $. 

Proof of Theorem \y.l\ first part: Construction of a global division kernel. In the situation 
above, the measure Pw ® ^w is concentrated on BTvk x ^Wi where 

ITw^ = {{s,Tw,s) : s e [0, 1], Tw,s G »Tw,s} 

and Aw = {{c,H) : c G Cw, H G (c)}. Take any event A in BT^y x A^y and recall 
(I3TD. Since 



^{Tw) = {{s,cnW,H) ■ {s,c,H) G ^(T), {c,H) G ^^'Aw}, 
Lemma |4. li e) then tells us that 

(4.6) Pw®^wiA) = Jp{dT) Yl M^w^^w)-'Ais,Ts-,c,H). 
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As a consequence, the measures Pw ® ^w with Pl^ G P are consistent in the sense that 

(4.7) (Pw'^^W')°(j^w(^T^w)~'^ = Pw^^w iorWcW'eF. 

To define a common extension of these measures we first localise to a fixed window 
V &F. For W D V we write Fw '^ ^v^w for the restriction of Pw ® ^w to the set 
MT\Y X TTy Ay- These measures have a finite total mass that does not depend on W. 
Indeed, (|4.6|) and the first-moment condition (|2.1U|) imply that 

\\Pw ® tv^wW = f P(dT) |{(s, c, H) G ^(T) : (c, H) G if ^Ay}| 

(4.8) •' 



< fp{dT)\Tv,i\ <oo. 



Since all spaces under consideration are Borel spaces, we can thus apply an abstract 
version of the Kolmogorov extension theorem |1H Corollary 6.15] to obtain a finite 
measure on BT x iiy Ay, to be denoted by P (8> ly^, which satisfies 

(P (g) lyl>) o (Ww ^ T^w) ^^ = '^W ® '^V^W 

for all M^ G P with W D V. Since V is arbitrary and 

IJ ny^Av = A := {(c, H)eCxM:H£{c)}, 

the measures P®ly$ can be glued together to a locally finite measure P(g)cE> on BT x A 
satisfying 

(4.9) (P ® 6) o {Ww (8) nw) "^ = Piy (8) ^w for all W gF. 

As we have indicated by the notation, disintegration shows that this measure is indeed 
the product of P with a locally finite measure kernel $. 

We must show that this ^ is indeed a (cumulative) division kernel. By construction, 
each $(s, Tg, • ) is supported on A, which means that each $(s, T^, c, • ) is supported on 
(c). In fact, considering the set A(rs) = {{c, H) : c G T^, H G (c)} and its complement 
—^A{Ts), we can write 

fdP{s,Ts)^{s,Ts,^A{Ts)) 

= [d(P0^){s,Ts,c,H) lim (l-l~-i^,^ .(c,i/)), 

and the last term vanishes by ()4.9p and Fatou's lemma. So we can conclude that, for 
P-almost all (s,Ts), ^(s,Ts, •) is indeed supported on A(rs), as required. D 

It remains to prove that the kernel $ satisfying (|4.9p satisfies the properties (a) to 
(d) of Theorem 13.11 Observe first that statement (d) follows directly from (|4.9p . As for 
(a), a combination of (j4.6p and ()4.9p shows that the asserted identity holds when there 
is some VF G P and a measurable function /^y with 

/(s, T, c, if) = l(,nH/>(i^) /h'(s, Tm^, c n T^, i7). 

The general identity thus follows by a monotone class argument. Furthermore, the 
proof of (b) can be literally taken from the proof of the implication '(c) implies (d)' of 
Lemma 14.11 

The proof of the Gibbs property (c) requires some more work. Let us first introduce 
a modification of the outer projection for a given window M^ G P, which refers to a 
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larger window W' G P rather than the full space M'^. Namely, for W C W G P and any 
Tw G ^Tw let 

(4.10) T|^''°"* = ({c G Th^,,. : c ^ int{W)}) ^^^^^ 

be the evolution of the cells hitting VF'\int(VF). In particular, if W^' = W then T^y'°" = 
T^. We also set t{;|^''°'°"* = (T^^o, t|^''°"*) and let iSjiJ^''"'""* denote the a-field on BTvy 
generated by the mapping T\y' >-^ T^ ' '°" . 

Proof of the Gibbs property, Theorem \3.lV c). Fix a window 1^ G P and let ^^r{ ■ |Ty^" ) 
be a regular version of the conditional distribution of tt^^ under the condition t^t^^ = 
T^" • We need to show that this probability kernel coincides with G^(- |Tp^" ) al- 
most surely. (In particular, this will imply that the latter is almost surely well-defined.) 
Pick any two non-negative measurable functions g{T^" ) and h{s^T^^ ^,c,H) of the 

indicated arguments. We suppose g is local, in that ^(T,;^" ) = giT^y^r ' '°" ) for some 
W G P containing W. Consider the integral 

(4.11) /"p(dT)5(T°^°^*) /'pi;^(dS|T°^°"*) Yl Ks,Ss-,c,H). 
By the definition of conditional distribution, this is equal to 



/ 



P(dT) g{T%"^') Yl ^(^' T^w,s-,c, H) . 



In view of the locality assumption on g, the integrand actually only depends on Tw'^ 
which is a pure jump process and therefore strongly Markov. Writing (sj, Ci,Hi) for the 
ith division event of Tw' ™ temporal order, we can rewrite the last expression in the 
form 

(4.12) j; /"p^,(dT^Off(TS;''°'°"*)l{.,<i,,,ciy}(TiyO/i(^.,Ti^,s,-,c.,i/.). 

j>i -^ 

Now, both h and the indicator function in the integrand are measurable with respect to 
the cr-field Bw',Si of all events A G B\y' with Ar\{si <t}£ 13\Y',t for all t. By the strong 
Markov property, we can therefore replace the function giT^/ ' '°^ ) by its conditional 
expectation g{si,Tyy'si) relative to 13\Y',Si- Furthermore, the process Tjy ' '°" is itself 
a Markov jump process. (In fact, it can be considered as the BRT in W' for the division 
kernel which equals $vK'('S,Tvv",c, • ) if c ^ 14^ and is identically zero otherwise.) This 
means that 

g{si,Tw',s,) = sCsj.T^,;/""*) = g{si,Tw',s,~) 

when Cj C W. Altogether, we find that the expression (|4.12|) is equal to 
/'p(dT) Y. 9{s,Tw',s-)his,T^^,_,c,H). 

{s,c,H)eS'{T):cCW 

By statement (a), this in turn coincides with 

f ds f P.(dT,) Y I ^(^' Ts' ^' d^) 9{s, T^W',s) h{s, r^,„ c, H), 
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which by the Markov property is equal to 



J dsJp{dT)g{T%°''') Y, J^s,T,,c,dH)h{s,T^^„c,H). 



,0.out 



Taking conditional expectation with respect to T^ and using the conditional division 
kernel $(^( • |T,5J^*) from Definition I2.1H we can rewrite this as 

J P(dT) giT%"^') £ ds J P'^^,{dS,\T%"^') 

X U'^{s,Ss,c,dH\T'^') h{s,Ss,c,H). 



Since the underlying spaces are Borel, a comparison of ()4.1ip with the preceding ex- 

-lOjOut 
■W ' 



pression shows that, for almost all T,'°" 



is,c,H)eS>(S) 

which corresponds to ()4.3p . Lemma l4.ll therefore implies that Ply(- |T^y" ) coincides 
with G^y( • |T^y " ). This completes the proof of the Gibbs property (c). D 

Here are two supplements to the preceding proofs. The first deals with the consist- 
ency properties ()4.7p resp. ()4.9p . and the second with a localised version of the Gibbs 
property (c). 

Remark 4.4. Consistency of kernel densities. Consider two windows W, W € P with 
W C W and a BRT P G ^ satisfying Pw' < 'Pw'fi'^w'- ^^^ '^w and ipw' be 
the A-densities of the division kernels ^w and ^w of Pvi^ and Pvy, which exist by 
Corollary 14.21 The consistency equation ()4.7p then means that 

for P\Y (g) Ay[^ almost all arguments. Here, Pw',s\Tws stands for a regular version of 
the conditional distribution of T^i^g under P^, given Tw,s, and c' = vr^j^ (c, Tiyi^g) is the 
unique element of Tw^s with c' nW = c. An analogous statement holds for W' = M 
when P admits a global division kernel with a A-density. 

Remark 4.5. Localising the outer conditions for conditional BRTs. Fix two windows 
W,W' eW with W C W and let Pw' be a BRT in W' for a division kernel ^w'- 
Furthermore, replace M by W' in Definition [27TT] and use the conditional division kernel 

$i;^(s, s„ c, • iTiyo := ^w^'(s, s, u Tj:;;;°"\ c, • ) , 

to obtain a conditional BRT G^"( ■ |T,y ' '°" ) in W; here we use the notation intro- 
duced in and after (j4.10p . The arguments in the proof of Theorem IS.lT c) then show 
that the kernel G^' is a regular version of the conditional distribution of vr'^y for Pw' 
under the condition By^^ ' '°" • 

Next we show that, for translation invariant P, $ can be chosen to be covariant 
under translations. 
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Proof of Corollary \3.3[ We only consider the covariance of the global division kernel. 
The second statement of the corollary is then obvious from Theorem l3.1l fd). Let P € ,^0 
be as stated and ^ a global division kernel for P, which exists by Theorem 13.11 For 
j; G M let i}x be the spatial translation by —x, which acts on BT via "dx '■ {s, T^) 1— )• 
(s, T^ — x) and, by hypothesis, leaves P invariant. As before, we use the same symbol 
for the translation 'dx : (c, H) \-^ {c — x,H — x) acting on C x H. For x G M let 
$^(s, T^, • ) := d(s, T^ + X, 1? "^ • ). We first claim that 

(4.13) P«)$ = P«)^'' 

for all X. Indeed, let A € B{MT) and B € B{C x H) be arbitrarily given. Then we can 
write, using the i^a^-invariance of P in the first step, 

(P «) l>^')(^ xB) = (P^^){^-^A X i^-^B) 

P(dT) Yl 1a(s,T,_) 

(s,c,_ff)e^(T): {c-x,H-x)&B 
(s,c,H)£&(T):{c,H)<^B 

= (P«)i)(ylx5), 

proving (|4.13|) . The second and the fourth step come from Theorem 13.1( a). and in the 
third step we observed that ^(T) consists of the shifted elements of ^(T — x) and then 
used again the translation invariance of P. Equation (|4.13|) shows that $^'( • , ■ ,B) = 
$( • , • , -B) P-almost surely for each B. 

To obtain an everywhere covariant version of <&, we pick a countable generator Q of 
B{C x H) which is stable under intersections. We also let F be the set of all (s, T^) € BT 
which are such that l>(s,T^,S) = $?'(s,Ts, B) for all S G ^ (and thus all 5 G i3(CxEI)) 
and the countably many lattice elements y € Z . First, ()4.13p implies that P(r) = 1. 
As an intermediate step, we then define 

r l'(s,T„.) if(s,T,)Gr, 
$(s,T^, ■) = { . 

[ A*(s,Ts, •) otherwise, 

where A* is the cumulative STIT kernel of Example 12.81 It is then clear that $ is a 
version of <& which satisfies ^^ = ^ for all y € Z . 

To achieve the covariance under the full translation group, we finally define 



$ = / ^"^ dx , 

J[i] 

where [1] is the centred unit cube in W^. Then for each y € M'^ we have 



^y = $^ dx = $ 

because [1]+?/ can be decomposed into finitely many pieces which are lattice translations 
of corresponding pieces of [1]. On the other hand, since 



P(g)$=/ P(g)$^dx=/ P(g)$^dx = P(g)$ 

J[i\ J[i] 

by ()4.13|) . ^ is also a version of $. D 



28 



4.3 On the inner entropy density 

We first recall some standard properties of relative entropy. A basic fact is the variational 
formula, which states that 

(4.14) n{fi, v) = sup gdi^i- log e^d 

for any two probability measures /U, z^ on a common measurable space. Here, the su- 
premum extends over all bounded measurable functions on this space; see |26l The- 
orem 4.1]. On the one hand, the variational formula implies the useful estimate 

(4.15) I gdii<n{n,u) + log I e^du 

for any non-negative measurable g. On the other hand, using Jensen's inequality it 
follows immediately that 'H(^, v) is jointly measure convex in both arguments simultan- 
eously. Also, it is jointly lower semicontinuous in (/i, v) in the topology generated by 
the integrals of bounded measurable functions. Finally, if /i and u are restricted to a 
sub-u-field A then relative entropy is increasing in A. Alternative proofs of these facts 
can be found in [9], Section 15.1, for example. Since 'H{a^; bu) = a7i{n, v) + bg{a/b) for 
a,b > and normalised /i, i/ (recall ()3.5p ). the last facts extend directly to the case of 
finite measures, except that the convexity then holds in the first argument only. 

Now, turning to the proof of Theorem 13.51 we proceed with a series of lemmas. Let 
P € ^e be arbitrarily given and P = P o tTq" its initial distribution. We can clearly 
assume that 



(4.16) lim inf n"*^ n^ (P) < oo 

because otherwise there is nothing to show. 

Lemma 4.6. Condition (|4.16|) implies condition (LAC). 

Proof. We fix a window M^ G P. For any n with W C [n] let 

(4.17) ^„ = {T e BT : en VF = for aU c e To with cna[n] / 0). 

Obviously, A^ G jB?, . Also, since for each T € T the union of all cells hitting W is 
contained in some [n], we have An t BT as n -^ co. Furthermore, if T € An then the 
inner window inr„i(s,T^,) (defined in ()2.29p ) contains W for all s, so that 

G[„](5|Tj„] ) = U^,(Tw,o,B) 

for all B € Bw- Suppose now that 'Hr°i (P) < oo and recall the notation of Definition l3.4l 
Writing P|^i(- |Tr\ ) for a regular conditional distribution of vrj^, under the condition 

TT,', = Tr', , we then have 

[n] [n] ' 

Pjn] ( • ITJ!;? ) « Gfn] ( • ItJ;? ) for P-almost aU T. 

Therefore, ii B e Bw is such that PU^{B) = then U^{Tw,o, B) = for almost ah T, 
and thus P]^^{B\T°^^) = for almost all T G A„. Hence 



PiBnAn)= f P(dT)Pi^](i?|T;;;p = o. 

-/ An 



Letting ?i — )■ oo through the integers n with ?^P, (P) < oo, we thus obtain that P(-B) = 0. 
So, we have shown that P ^ PTL^ on Bw, and the proof is complete. D 
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Combining the preceding lemma with Theorem 13. 11 we can conclude that P admits 
a global division kernel $. Hence, for each window VF S P, the conditional distribu- 
tion of tt'j^ given Byy under P resp. PII^ are equal to the localised conditional BRTs 
G^^( • |T^^) resp. G^( • |T^^) introduced in Remark 1^5) resp. Example [ZT2l It follows 
that 

(4.18) n^{P) = Jp{dT)n{Gl^{-\T%');G^{-\T%^)). 

This expression can be specified as follows. 

Lemma 4.7. Under ()4.16p . we have for each W £F, 

(4.19) n'^iP) = f dPw{s,Tw) Y. '^i^w{s,Tw,s,c, ■y,l^,)A) . 

Proof. By Lemma 14.61 and Corollary 14.21 $vy is absolutely continuous with respect to 
A; we write ifw for the associated Radon-Nikodym density. Using the equivalence 
of Lemma I4.1f a) and (b) separately for the intervals between the 'immigration times' 
(j2.30p . we obtain the following identity for the Radon-Nikodym density of G^( • |T^ ) 
relative to G^(-|T^^): 



i°g TZ : Z^a! ^T^w) = ^wio, 1; T^^^) - ^Uo, 1; T^) 



(4.20) ^^ '^ 



+ X] ^oS^w{s,Tw,s-,c,H), 



(s,c,H)Gi^(T-) 



where T\y € MT]y, ^(TJ^) is the associated set of division events with c C W, and 
similarly to (|4.ip . 

and %(0, 1; T'^) = J,' ds EeeT^^,^ A((c)). 

Now, n^{P) is simply the integral of ([120]) over P^^(dT^^)G;^^(dTi;^|T^^) = 
Pvi/(dTvi/). By the equation in Lemma 14.1( c). integration of the last term in (j4.20p 
gives the contribution 

dPw{s,Tw,s) V / ^w{s,Tw,s,c,dH)logipw{s,Tw,s,c,H). 

J cel^n he) 

In terms of the measure C ' which is defined by restricting the sum in (j2.36p to 
the cells c G T^ ^^ this can be rewritten in the concise form J dC ' ® A* (pw log ipw 
Likewise, we have 

y"pH/(dTH/)[Al^(0,l;Ti^)-<^i;^(0,l;TH/)] = j dC^''' '"^ (^ K* [l-^w]- 

Consequently, the Piy-integral of (j4.20p is equal to /dC ' ® A* ^(</?iy), and (j4.19p 
follows by recalling (|3.4p . D 

The final step in the proof of Theorem 13.51 is the following. Let /i"(P) be defined by 
(I3:8D . For brevity we write h'^{P)~'^W^-^{P). 
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Lemma 4.8. Under (j4.16p . 

lim /ijf(P) = supC(P) = ^'""(P)- 

Proof. We claim first that liminf„_j.oo /i™(P) > /i'"(P). We pick any < r/ < 1 and 
£ < oo and restrict the sum in ()4.19p for W = [n] to the cells of Tg with midpoint 
in [nrj\ and radius at most I. More precisely, we let L„ C W^ be such that the set 
{[rj\ + X : rE G L„} is a tessellation of the cube [nr]]. (Note that |I/n|'^-) Also, we take 
any < e < 1 — 77 and let n be so large that rj + 1 < en. Then we can write 

C(P) > n-" Y, KnM 

X&Ln 



K,,,Ax) := j dP{s,Ts) Yl ^(^[n](s,T[„],„c, •);l(c>A), 



with 

hn,,^Ax):= [d'P{s,Ts) 

where T^^£{x) is the set of all cells c satisfying m{c) € [r/] +2; and r(c) < i. Now, whether 
or not a cell c G ^[n],s belongs to Trj/{x) can be decided by looking at the restriction 
T[en]+x,s- So, using Remark 14.41 and Jensen's inequality together with the translation 
invariance of P and the covariance equation ()3.3p we find for each x G L„, 



K,ri,ei^) = '^^len]+x{s,T[^n]+x,s) 2-/ / PN.s|T[,„i+,,,(dT 



c£Tsnr^j{x)' 
X / A(dif)e((/.[„](s,T[„]^„c,i/)) 

J{c) 



> / dP[£„](s,T[g„]^J ^ / A(dif)£i((/5[g„](s,T[g„]_^,c,iJ)) 



p 



= n{lr^,C ®$ |B(,„,;lr,,C ^ A* |b(,„, ) . 

In the last expression, yB[£„] is identified with the a-field that is generated by the projec- 
tion Tfrg^i ® id, and T^^i := Tn/{0) is viewed as a set in the product space BT x C x H. 
In the limit as n — >• oo, Perez' continuity theorem for relative entropies (cf. [9l Proposi- 
tion 15.6]) implies that the last relative entropy converges to 

n{tr^^,'&'(^^;lr^,C^^A*)= fdP{s,Ts) Yl ^(^(s,T„c, • );l(c>A). 

By the time-integrated version of the Palm formula (|2.33p and the shift covariance of $ 
and A, the last integral is equal to 

h^,r-=r]'' I dP°(s,T„c)?^($(s,T„c, •);l(c>A). 

J{r{c)<e} 

Altogether, we find that lim inf „,_>.oo ^n (P) — ^r?/) ^^^ the claim follows by letting ry — > 1 
and i ^- oo. 

It remains to show that /ij-f (P) < /i'"(P). By (|iJ6]l and the above, /i"(P) < oo. 
This implies that the kernel $ admits a Radon-Nikodym density relative to A. Applying 
Remark 14.41 and Jensen's inequality as above, we conclude from (|4.19p that 

h'^{P)<n~^ fdP{s,Ts) Y ^(^(s,T„c, •);l(c>A). 

ceTs:cC[n] 

The condition under the sum above implies that m{c) € [n]. Using again (|2.33p in its 
time-integrated version, we thus find that the last expression is not larger than /i™(P). 
The proof is thus complete. D 
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Remark 4.9. The inner entropy of a BRT P in a window 1^ € P can be defined 
by considering the tessehations not only in W but also in some neighborhood of W. 
Namely, if $ is a division kernel for P and r > 0, one can introduce the quantity 

n\^^iP) = I p(dT) ?^(G^-+-^ ( . |t|^+^-°'-*); G^( • |to/)) , 

which is called the inner entropy of P in W with horizon r. Here, the first of the 
conditional BRTs G is as in Remark 14.51 A glance at the preceding proof then shows 
that Lemma 14.81 can be extended to yield 

hm n-'^?^rf°(P) = /i-(P). 

Next we turn to the proof of Theorem 13.61 which is split into two lemmas. 

Lemma 4.10. The inner entropy density /i™ is affine. 

Proof. As noticed after (|4.15p . relative entropy is a jointly convex function of probabilty 
measures. This shows that the inner entropies ^-i^r{P) = T-i(Pw\Pw ®^w) ^^^ convex 
in P, and so is their limit /i™(P). The proof is therefore completed by showing that 
this limit is also concave. So, let P,P' E i3^q, < a < 1, P = aP + (1— a)P', and 
assume without loss of generality that /i™(P) < oo. By Lemma 14.81 it follows that 
/i™(P) < oo for all n. In particular, P[„] <^ Pr\ (g) GAi with a Radon-Nikodym density 

Qn- The Radon-Nikodym theorem further implies that P[„] <C P[„] and P| , ^ P[„] 
with densities /„ and f!^, respectively. It is clear that a/„ + {\—a)f'n = 1 almost surely 
for P[nl- Moreover, it follows that Pr\ = fn Pr^i for a suitable Radon-Nikodym 

density /„' . We conclude that P[„] = {fn9n/ fn )P[^] ® Gj^j. Since /„ < 1/a and 

/ dP[„] log /°'^ = n{v\f^; V\f^) > 0, this gives 

n'^C(P) = y"dP[„]log^<y"dP[„]log<7n + log^. 
Together with the analogous inequality for P', we finally end up with the estimate 
aC(P) + (1-a) C(P') < n-^dPn log^n + o(l) = C(P) + o(l) . 

The result thus follows from Lemma 14.81 bv letting n ^^ oo. D 

As for the topological properties of /i™, we note first that its lower semicontinuity 
is a direct consequence of Lemma 14.81 and the lower semicontinuity of relative entropy; 
recall the discussion below (|4.15p . Since T >—^ l^[i],il is the supremum of bounded local 
functions, it is also evident that the hitting intensity i{-) is lower semicontinuous. It 
follows that the restricted level sets =^e,P,/3,7 (as introduced in Theorem 13. 6p are closed. 
The following lemma, which can be viewed as a refinement of Lemma 14.61 will imply 
that they are in fact compact; as the intensity bound is not needed here, we put /3 = oo. 

Lemma 4.11. The restricted level sets ^0,p,oo,7 Q^re locally equicontinuous in the fol- 
lowing sense: For each VF E P and < 7 < oo and every sequence B^ G Bw with B^ I (J) 
as k ^f 00, one has 

lim sup P(5fc) = 0. 
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Proof. Let M^ G P and a sequence B^ G Bw with i?fc | be given. Pick some e > 
and consider the events An defined in ()4.17p . By definition, An G B9-, q. In particular, 
P{An) depends only on the initial distribution of P, which is P for all P G ^e,P,oo.-y 
So there is an n with P{An) > 1 — e for all P G ^0,p,oo,7- 

Next, each P G =^e,P,oo,7 admits some division kernel $, and ?^|"i(P) < rj := n 7 
by Lemma 14.81 Since 

^in](P) = /p(dT)?^(Gfj'(-|TJ:;f);Gf„](-|T5;^)) 

by definition and Remark 14.51 we can conclude that the set 

Hn:= {T eMT :n{G'^^\-\T'^;;^y,Gf^^{-\Tl^)) <rj/e} 

in B,\ has measure at least 1 — e for P. It follows that 

[n] 

P(Sfe) < 2e + P{Bk nAnnHn) = 2e+ f P(dT) G^f*' (^fclT^;]^) 

because in[„](s,T?i) D W for all T G An and all s; recall ()2.29p . The next step is to 
use the inequality (|4.15p . For T G An fl Hn, this inequality shows that 

{v/e')G^^-\Bk\Tl^) < (r?/e) +log|dn^(T^,o, • ) exp [iv/e')lB,] 

since GAi( • |T?i) = 'n.yy{Twfl, ■ ) on Bw when T G ^n- Inserting this into the previous 
inequality we find 



Pei 



sup PiBk) < 3e + ie^T]) f PwidTw) log I dn^{Tw, ■ ) exp [(rj/e^) IpJ 



',P,oc,'y 



Letting k —^ 00, using the dominated convergence theorem, and noting that e was chosen 
arbitrarily, we arrive at the lemma. D 

The preceding lemma verifies the conditions of Propositions (4.9) and (4.15) of [9], 
which imply that ^e,P,oo,7 is relatively compact and relatively sequentially compact 
within the class of all translation invariant BRTs. However, this does not yet imply that 
each limit of a net in ^e,P,oo,7 also satisfies the first-moment condition (|2.3ip . This 
is the reason why we need to introduce the bound ii(P) < /3 which makes this fact 
obvious. The proof of Theorem 13.61 is thus complete. 

4.4 Free energy density, variational principle, existence 

Throughout this section we fix a moderate division kernel ^. Our first item is the 
existence of the energy density. 

Proof of Theorem \3.8[ Let P G ^q be a BRT with a covariant division kernel ^. By 
Theorem I3.ir a) , the inner energy of P in a window VF G P can be written in the form 

(4.21) U^(P; ^)= f dP{s, Ts) Yl [ ^(*' T^' ^' d^) l°g ^(^' '^^' ^' ^) ■ 

•^ ceTs-.ccW^ 

Since $ and '0 a-re covariant, the time- integrated version of the Palm formula ()2.33p 
shows that the last term can be written in the form 

/"dP°(s,T„c)vol(x -.c + xCW) f <Pis,Ts,c,dH) log V(s, T„c, i/) . 
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Hence 

with 

\Sn(P;^)\ <K^ fdP\s,T„c)vo\{xe[l] :c/n + x^[l])$(s,T„c,(c)) 

by (M3). The volume term above is bounded by 1 and tends to as n ^ oo. To 
apply the dominated convergence theorem we thus need to show that the total mass of 
P eg) <& is finite. But the Palm formula and ()4.8p show that this mass is at most ii(P). 
This completes the proof of the first part of Theorem 13.81 and implies the bound on 
|n"^(P;^)|. 

The proof of the second part is similar: The Palm formula gives 

Vi^(P;'^)= [dP\s,Ts,c)yol{x:c + xCW) f A{dH){i;{s,T,,c,H) - l) 

J J{c) 

and thus n~"' Vr™i(P; ^) = v^'^{P;^) + (5^(P;^) with a remainder term 6'^ which, by 
assumption (M4), is bounded in modulus by k'^ times 



/dP^(s,T„c)vol(xG [1] ■.c/n + x(^ [1]). 



By (|2.37p and the dominated convergence theorem, this bound vanishes in the limit 
n —7- oo. The proof of Theorem 13.81 is therefore complete. D 

Remark 4.12. Exploiting Theorem I3.ir a) and the Palm formula (|2.33p in the same 
way as in the first part of the preceding proof, one finds that the energy density can be 
written in the alternative form 

n^'^(P;^)= /"p(dT) Yl logi^{s,Ts-,c,H), 

{s,c,H)£S^{Ty.m{c)€[l] 

in which the division kernel of P does not appear. In particular, it follows that ti™( • ; ^) 
is affine. 



Turning to the proof of the variational principle, we introduce an inner relative 
entropy of a BRT P in a window VK € P with horizon r = n^, relative to ^ as follows: 
If P admits a division kernel $ we set, using the notation of Remark 14.51 



(4.22) n\f{P;^) = /"p(dT)?^(G^^+^'-(-|TU;+^'-'°'°"*);G*^'+^'-(-|Tj|;+^'-'°'°"^)); 

otherwise we set 7i^ (P;^) = oo. (Compare this definition with Remark 14.91 where 
^ = A*.) By the bounded-range property (M2) of *, the conditional BRT G^'^+^'- 
in ()4.22p actually coincides with G^. We then have the following convergence to the 
quantity /i™(P;^) in (fXTT]) . 

Corollary 4.13. Let ^ be a moderate division kernel and r = rq, its range. Then 

^in/p.^N lim n~'^n7^\P]^) 

for all P € ^0. The limit is finite if and only i//i™(P) < oo, and then Equation (|3.12p 
holds. 
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Proof. An analog of Equation (j4.20p gives for each n the identity 

(4.23) n^iP; ^) = -Hl;^{P) - ^°](P; ^) + Vf^^jlP; ^), 

which is a counterpart to ()3.1ip . Also, the estimates in the proof of Theorem 13.81 show 
that the second and third term on the right-hand side are bounded in modulus by a 
finite constant times n . The convergence result thus follows directly from Remark 14.91 
and Theorem 13.81 (together with Lemma 14.61 and Theorem 13. ip . 

Next, suppose that /i™(P) < oo and let ip and ip be the Radon-Nikodym densities 
of $ and ^ with respect to A*. Inserting the explicit expressions for all quantities, we 
then obtain 

/i'°(P) -n'°(P;'f) +t;'°(P;'f) = f dP^ ® A* [- ip + iplog^ - ^logip + i;] 

dP^A* ijgif/ij) = /"dP° ni'^;^), 

which is dSlSD. D 

The variational principle, Theorem 13.91 follows directly from Equation (|3.12p and 
thus from the preceding corollary. 

Next we address the existence problem for BRTs with given division kernel, as stated 
in Theorem 13.101 We still keep a moderate ^ fixed and let r = r^ be its range. We 
also fix an initial distribution P € ^q{T). We will construct a BRT P with initial 
distribution 7ro(P) = P and division kernel ^ as a cluster point of some approximating 
measures P"-''^'^^ 

Specifically, for any n we let n+r and consider the shifted cubes [n\i = [n]-|-ni, i S Z , 
which are separated by a grid of corridors of width r. Let [n], = IJieZ'* W« ^^ their union. 
We introduce a BRT P" for which the cells within the boxes [n]i evolve according to ^, 
while the cells that hit the corridors between these boxes evolve according to A*. (This 
is inspired by the familiar construction of independent repetitions in disjoint blocks, but 
avoids an artificial cutting of cells at the block boundaries by using the STIT process in 
the corridors between the blocks.) Formally, we introduce the projection 

0,OUt _ rp ,^ rpO,OUt ._ f \ \ rpm O TOUt \ 

and define 

(4.24) p" = (pn^)^;^ ® (g) G*j^ . 

More explicitly, P" is defined by its integrals 

fdP-= fpu\dT) H /'Gf;]^(dS.|Tj;-*) /(Tj;-*uUS,) 

for measurable functions / > on BT. By the bounded-range property (M2), the 
conditional BRTs G?, ( • |Tr '?" ) depend only on Tr '?" , so that P" is well-defined. It 
is easily seen that P" is a BRT with initial distribution P and division kernel 

^(s, T, c, • ) if c C [n]i for some i G Z'^, 



(4.25) *"(s,T,c, .) _ , ^ ,. , . . . 
^ ^ V ' ' ' / I A((c) n •) otherwise. 

To achieve translation invariance, we introduce the average 

(4.26) P"'^^ = n-'^ [ da; P" o 1?/ 



' X 



35 



The next two lemmas show that the BRTs P">'^^ belong to a restricted level set of the 
inner entropy density, and thus have a cluster point. 

Lemma 4.14. (a) There exists a constant /3 < oo such that ii(P"'^^) < /3 for all n. 

(b) For every e > there exists some r < oo such that 



y P"'-(dT) |T[i],i| l{|r[„,,|>.} < e for all n. 



Proof. Let «; = Kij,. Since ip < e'^ hj (M3), it follows that each kernel \I/" also has a 
A-density "0" satisfying -0" < e" for all n. With the help of Remark 14. 4^ we can further 
conclude that this bound remains true after localisation to a window Vl^ G P (relative to 
P"); that is, the localised kernel ^^r has a A-density ijjy^ with ■0|y < e.'^- In particular, 
if VF = [1] + x is a translate of the unit cube, then 

(4.27) ^;V(s, T:w,s. c, (c)) < e''A(([l])) =: a < oo 
for all possible arguments. In view of Lemma 14.31 it follows that 

(4.28) y"p"(dT) |r[i]+,,i| < /? := io(^)e", 

and statement (a) follows by averaging over x. 

To prove (b) we still let W = [l]+x and define ei = e/4io(-P)- By Lemma [4. 3t there 
exists a number ri with 

sup /p"(dT) {\Tw,i\-Ti\Tw,o\), < eiioiP) = e/4. 

n J 

For any T2 we then find (by distinguishing whether or not ti|Tvi/,o| ^ ^2) that 

y"p"(dT) (|rH.,i| - T2)^ < e/4 + e^JpidT) \T[^\l{\r^,,\>r,/r,} , 
which is at most e/2 for suitable choice of T2. Setting r = 2t2 and using that 
\Tw,i\ < '^{\Tw,i\ -T2)^ on {\Tw,i\ > r}, 

we then see that 

/P"(dT)|r[i]+.,i|l{|T„,.J>.}<^ 

for all n. Statement (b) thus follows by taking the average over a; € [n]. D 

Lemma 4.15. /i™(P"'''''; ^) -> as n ^ 00. 

Proof. Fix any n and let P" and ^^ be given by (|4.24|) and (|4.25|) . Consider the inner 
relative entropy of P"'^^ relative to ^ in a large cube W = [k] and with horizon r = nii, 
as defined in (|4.22|) . In concise notation, (|4.22|) reads 

As relative entropy is jointly measure convex, we have 

^r^n(p„,av.^) < n"^ / dx Ti'^l^iP''' ; ^) . 

J\n] 
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To estimate this further we note that P" has the division kernel ^^. A combination of 
(BSSl), (I32II, pHI]) and an analog of (|CT9]) thus gives the formula 

•^ c£Ts:cCW+x 

where ?{(*"; *|s,Ts,c) = n{^'^{s,Ts,c, ■);'^{s,Ts,c, ■)) for brevity. We can further 
use that W + x C [k+n] when x € [n]. Altogether, we obtain 

c&Ts:cC[k+n] 

Next, it is clear from ()4.25p that 'H{^";^\ ■ , -,0) = when c C [n]i for some i. 
On the other hand, for any cell c hitting the corridors between the boxes [n]j we have 
■H(^"; ^1 • , • , c) = ?^(A*; ^| • , • , c), which is bounded by a constant. Indeed, the func- 
tion Q{a) defined in (j3.5p is bounded by a multiple of |a — 1| as long as a < e*^*. 
Assumptions (M3) and (M4) therefore imply that 

n{A*;^\-,-,c)= [ A{dH)i;{-, ■ ,c,H) Q{l/i;{- , -^cH)) < k^ 

J{c) 

for some constant k^ < (X) and all c hitting the corridors. 

Now let k = {l—l)n for some integer (. and L^ be such that {[1] + x : x G L^} is a 
tessellation of [In] \ [n],. The preceding estimates then show that 



<K<i,Y dP"(s,T^) |T[i]+^_i| < Kii,p\Le\. 



The last inequality comes from (j4.28p . Letting ^ — ?> 00 and applying Corollarv l4.13| we 
finally see that 

This proves the lemma. D 

Combining Equation (jS.lip with the last lemma and the bounds in Theorem 13.81 and 
Lemma l4.14r a). one finds that 

;jin(pn,av) < (^^ ^ ^/^)^ + 1 =: 7 < OO 

when n is large enough. That is, the measures p*^'*^^ eventually belong to the sequentially 
compact level set ^e,P,/3,7 of Theorem 13.61 This means that a subsequence converges in 
Tioc to some P in this set. We need to show that P has the division kernel '^ . In view 
of Theorem 13.91 this will follow once we have shown that /i™(P;^) = 0. By the last 
lemma, it is therefore sufficient to verify that /i'°( ■ ; ^) is lower semicontinuous on the 
closure of the sequence {P"''^^ : n > 1}. In view of Equation (|3.1ip and Theorem 13.61 
this follows from the next lemma, which completes the proof of Theorem 13.101 

Lemma 4.16. The junctionals u™( • ; ^) and u™( • ; ^) are continuous on the closure '^ 
of the sequence {P"'^^ : n > 1}. 
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Proof. First we observe that the estimate in Leninia [4.14l fb) holds not only for all p^^^^, 
but even for all P € ^. This is because the integral there is a lower semicontinuous 
function of the integrating measure. We further know from Lemma [4 . 6 1 that each P € ^ 
satifies (LAC). Hence, Theorem 13.81 and Remark 14.121 can be applied. 
It follows that M™(P; ^) = JudP for the function 

n(T)= Yl logi^{s,Ts-,c,H) 

is,c,H)eS>{Ty.m{c)£[l] 

on BT, which in general is neither bounded nor local. We will therefore replace u by a 
truncated version 

u,,,(T) = l{|T[i],i|<r} Yl log^is,T,^, C,H), 

(s,c,H)e^(T): m{c)e[l],r{c)<e 

for suitable numbers r and i; r{c) is again the radius of c. The function Ur/ is bounded in 
modulus by K\^t and also local because of (M2). It differs from u by at most Ky^{6r + 6i) 
with the error functions 

(5^(T) = l{|Tjy J>r} |T[i]_l| , ^^(T) = 2_^ l|^(c)>£}. 

is,c,H)£S>(T):m{c)e[l] 

As noticed at the beginning of this proof, we have suppgc^ J 5r dP ^^ as r ^ oo. On 
the other hand, the function 6£ is not larger than 

^KT) = Y l^con[i],il' 

coeTb:con[l]7^0,r(co)>£ 

and Lemma 14.31 gives the estimate 



sup /" <5; dP < e" [ P{dT) V 1 



{r(co)>i} 

for the constant a in ()4.27p because each P € '^ has initial distribution P. This bound 
does not depend on n and tends to as £ — )■ cxd because io(-P) < oo. We have thus shown 
that the restriction of n™( • ; ^) to ^ is the uniform limit of the functions P >—^ J Urj dP, 
which are continuous in tiq^- 

The analogous result for ^^(p'^'^'^; xj/) ig achieved in a similar way by truncating the 
function 

v{T)= f ds Yl f ^{dH){^P{s,Ts,c,H)-l) 

and using (M4). D 

As the proof of Theorem 13.101 is now complete, we turn to its corollary. 

Proof of Corollarv \3.11\. Suppose P G '^q{^) is not extremal in i^e- Then P = aP^ + 
(l-a)p2 for some < a < 1 and two distinct BRTs P\p2 e ^e- By Theorem [321 
Remark 14. 121 and Theorem 13. 9[ it follows that 

= /i'°(P; ^) = a /i''^(P^ ^) + (1-a) /i'°(p2; ^) , 

so that P^,P^ both belong to ^e(^)- Hence, P is not extremal in §^e(^)- D 

Our final observation concerns the uniqueness problem discussed in Remark l3.12l We 
will exploit the fact that, in one space dimension, we always have that ^^^^j' A((c)) = 
A.[{W)) when VF G P and Ty^r G '^w- Consider the following variants of conditions (M3) 
and (M4): 
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(M3') ^ is STIT-bounded, in that ^ < A',j, A* for some constant K^iJ < oo. 

(M4') ^ is S TIT for large cells, in that ^( • , • , c, • ) = A((c) n • ) whenever diam(c) > 
r'q, for some constant r^ < oo. 

Proposition 4.17. Suppose that the space dimension is d = 1. Let P G ^{T) and ^ 

be a division kernel satisfying (M2), (M3') and (M4'). Then there exists at most one 
BRT for ^ with initial distribution P. 

Proof. Suppose there exist two distinct BRTs P, P' for ^ with the same initial distri- 
bution P. Consider the difference measure P = P — P' and fix an interval [k] G P. Let 
g be yBr^i -measurable with \g\ < 1. Using property (c) of Theorem I3.H we obtain for 
each < t < 1 the identity 



with 



gdPi = I dsypf(dT,)Lf5(T. 



Lfg(T,)= V / ^{s,Ts,c,dH)[g{0s,c,H{T^s))-9{Ts)] 



ceTs:cn[k]yt$ 



Now, (M2) and (M4') imply that L*sr(T^) depends only on T^^+rU with r = 2{r^ + r'^). 
On the other hand, using (M3') and the additivity of c i— )> A((c)) we find that 

\lLfg{Ts)\<2K^ ^ A{{cn[k])) = 2K^A{{[k]))=:ak. 

The total variation norm 6k{t) := ||P[^i J| thus satisfies the inequality 

(4.29) Sk{t)<ak [ 5k+r{s)ds 

Jo 

of Gronwall type. (Note that 5k is increasing and therefore measurable.) Since 5k-\-nr{s) < 
2, we obtain by n-fold iteration 

6k{t) < 2 a"(/fc + nr)''t''/n\ < 2 e^ (ate'')'^ 

and thus, in the limit as n — )• oo, Sk{t) = for all t < e := l/(ae'') and all k. Inserting 
this into ()4.29p and repeating the estimate, we obtain that 5k{t) = for all t < 2e and 
all k. Continuing in this way we finally find that 5^(1) = for all k, which means that 
P = P'. D 
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